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DECISION AND ORDER ON PRELIMINARY 
AND MISCELLANEOUS MOTIONS AND REQUESTS 



tEDDY S. GRON, Administrative Patent Judge . 
1 . Background 

September 26, 1983 - Michael J. Adang and John D. Kemp filed 
U.S. Application 06/535,354, entitled "Insect Resistant Plants" 
(assignment to Agrigenetics Research Associates Ltd. recorded 
September 20, 1983; assignment to Lubrizol Genetics Inc. recorded 
May 29 1986), now abandoned. 

April 4, 1986 - Michael J. Adang and John D. Kemp filed 
U.S. Application 06/848,733, entitled "Insect Resistant Plants" 
(assignment to Lubrizol Genetics Inc. recorded June 25, 1986) , 
now abandoned, which is claimed to be a continuation-in-part of 
U.S. Application 06/535,354, filed September 26, 1983. 

September 9, 1988 - Michael J. Adang, Thomas A Rocheleau, 
Donald J. Merlo and Elizabeth E. Murray filed U.S. Application 
07/242, 482T'entitled "Synthetic Insecticidal Crystal Protein 
Gene" (assignment to Lubrizol Genetics Inc. recorded October 24, 
1988), now abandoned, which is claimed to be a continuation-in- 
part of U.S. Application 06/848,733, filed April 4, 1986, which 
is claimed to be a continuation-in-part of U.S. Application 
06/535,354, filed September 26, 1983. 
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February 24, 1989 - David A. Fischhoff and Frederick J. 
Perlak filed U.S. Application 07/315,355, entitled "Synthetic 
Slant Genes And Method For Preparation" (assignment to Monsanto 
Company recorded February 24, 1989), now abandoned. 

August 7, 1989 - Kenneth A. Barton and Michael J. Miller 
filed U.S. Application 07/390,561, entitled "Expression of Genes 
In Plants" (assignment to Agracetus recorded August 7, 1989; 
assignment to Monsanto Company recorded October 15, 1996) , now 
abandoned. 

February 12, 1990 - David A. Fischhoff and Frederick J. 
Perlak filed U.S. Application 07/476,661, entitled "Synthetic 
Plant Genes And Method For Preparation" (assignment to Monsanto 
Company recorded February 12, 1990), now abandoned, which is 
claimed to be a continuation-in-part of U.S. Application 
07/315,355, filed February 24, 1989. 

January 28, 1992 - Michael J. Adang, Thomas A Rocheleau, 
Donald J. Mea?to--and Elizabeth E. Murray filed U.S. Application 
07/827,844, entitled "Synthetic Insecticidal Crystal Protein 
Gene" (assignment to Mycogen Plant Science, Inc., recorded 
April 1, 1993), now abandoned, which is claimed to be a 
continuation-in-part of U.S. Application 07/242,482, filed 
September 9, 1988, which is claimed to be a continuation-in-part 
of U.S. Application 06/848,733, filed April 4, 1986, which is 
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claimed to be a continuation-in-part of U.S. Application 
06/535,354, filed September 26, 1983. 

> r January 30, 1992 - Kenneth A. Barton and Michael J. Miller 
filed involved U.S. Application 07/827,906, entitled "Improved 
Expression of Genes In Plants" (assignment to Monsanto Company 
recorded October 15, 1996; assignment to Monsanto Technology LLC 
recorded June 13, 2001), which is claimed to be a continuation of 
U.S. Application 07/390,561, filed August 7, 1989. 

October 9, 1992 - David A. Fischhoff and Frederick J. Perlak 
filed U.S. Application 07/959,506, entitled "Synthetic Plant 
Genes'' (assignment to Monsanto Technology LLC recorded June 13, 
2001), which is claimed to be a continuation of U.S. Application 
07/476,661, filed February 12, 1990, which is claimed to be a 
continuation-in-part of U.S. Application 07/315,355, filed 
February 24, 1989. 

May 3, 1993 - Michael J. Adang, Thomas A Rocheleau, Donald 
J. Merlo ar*d-- : 'E±izabeth E. Murray filed U.S. Application 
08/057,191, entitled "Synthetic Insecticidal Crystal Protein 
Gene" (assignment to Mycogen Plant Science, Inc., recorded 
April 1, 1993), which is claimed to be a continuation of U.S. 
Application 07/827,844, filed January 28, 1992, which is claimed 
to be a continuation-in-part of U.S. Application 07/242,482, 
filed September 9, 1988, which is claimed to be a continuation- 
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in-part of U.S. Application 06/848,733, filed April 4, 1986, 
which is claimed to be a continuation-in-part of U.S. Application 
^6/535,354, filed September 26, 1983. 

January 6, 1995 - Michael J. Adang, Thomas A Rocheleau, 
Donald J. Merlo and Elizabeth E. Murray filed U.S. Application 
08/369,839, entitled "Synthetic Insecticidal Crystal Protein 
Gene," which is claimed to be a division of 08/057,191, filed 
May 3, 1993, which is claimed to be a continuation of U.S. 
Application 07/827,844, filed January 28, 1992, which is claimed 
to be a continuation-in-part of U.S. Application 07/242,482, 
filed September 9, 1988, which is claimed to be a continuation- 
in-part of U.S. Application 06/848,733, filed April 4, 1986, 
which is claimed to be a continuation-in-part of U.S. Application 
06/535,354, filed September 26, 1983. 

January 6, 1995 - Michael J. Adang, Thomas A Rocheleau, 
Donald J. Merlo and Elizabeth E. Murray filed U.S. Application 
08/369, 8 3 5, ^.entitled "Synthetic Insecticidal Crystal Protein 
Gene" (assignment to Mycogen Plant Science, Inc., recorded 
April 1, 1993), which is claimed to be a continuation-in-part 
of 08/057,191, filed May 3, 1993, which is claimed to be a 
continuation of U.S. Application 07/827,844, filed January 28, 
1992, which is claimed to be a continuation-in-part of U.S. 
Application 07/242,482, filed September 9, 1988, which is claimed 
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to be a continuation-in-part of U.S. Application 06/848,733, 
filed April 4, 1986, which is claimed to be a continuation-in- 
part of U.S. Application 06/535,354, filed September 26, 1983. 

January 10, 1995 - involved U.S. Patent 5,380,831 issued 
from Michael J. Adang, Thomas A Rocheleau, Donald J. Merlo and 
Elizabeth E. Murray, U.S. Application 08/057,191, filed May 3, 
1993. 

May 3, 1995 - David A. Fischhoff and Frederick J. Perlak, 
filed involved U.S. Application 08/434,105 entitled "Synthetic 
Plant Genes And Method For Preparation" (assignment to Monsanto 
Technology LLC recorded June 13, 2001), which is claimed to be a 
divisional of U.S. Application 07/959,506, filed October 9, 1992, 
which is claimed to be a continuation of U.S. Application 
07/476,661, filed February 12, 1990, which is claimed to be a 
continuation-in-part of U.S. Application 07/315,355, filed 
February 24, 1989. 

May 19^1-996 - U.S. Patent 5, 500, 365 issued from David A. 
Fischhoff and Frederick J. Perlak, U.S. Application 07/959,506, 
filed October 9, 1992. 

August 29, 1996 - Michael J. Adang and Elizabeth E. Murray 
filed U.S. Application 08/705,438, entitled "Synthetic 
Insecticidal Crystal Protein Gene Having A Modified Frequency Of 
Codon Usage" (assigned to Mycogen Plant Science, Inc.), which is 
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claimed to be a division of U.S. Application 08/369,835, filed 
January 6, 1995 (now U.S. Patent 5,567,600), which is claimed to 
t^e a continuation-in-part of 08/057, 191, filed May 3, 1993 (now 
U.S. Patent 5,380,831), which is claimed to be a continuation of 
U.S. Application 07/827,844, filed January 28, 1992, which is 
claimed to be a continuation-in-part of U.S. Application 
07/242,482, filed September 9, 1988. 

August 29, 1996 - Michael J. Adang and Elizabeth E. 
Murray filed U.S. Application 08/704,966, entitled "Transgenic 
Plants Comprising Synthetic Insecticidal Crystal Protein Gene 
Having A Modified Frequency Of Codon Usage" (assigned to Mycogen 
Plant Science, Inc.), which is claimed to be a division of U.S. 
Application 08/369,839, filed January 6, 1995 (now U.S. Patent 
5,567,862), which is claimed to be a continuation-in-part of 
08/057,191, filed May 3, 1993 (now U.S. Patent 5,380,831), which 
is claimed to be a continuation of U.S. Application 07/827,844, 
filed January- -2-8-, 1992, which is claimed to be a continuation-in- 
part of U.S. Application 07/242,482, filed September 9, 1988. 

October 22, 1996 - U.S. Patent 5,567,600 issued from 
Michael J. Adang, Thomas A Rocheleau, Donald J. Merlo and 
Elizabeth E. Murray, U.S. Application 08/369,835, filed June 6, 
1995, including a disclaimer of the terminal portion of the 
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patent extending beyond the expiration date of U.S. Patent 
5,380,831. 

October 22, 1996 - U.S. Patent 5,567,862 issued from 

> t ■ 

Michael J. Adang, Thomas A Rocheleau, Donald J. Merlo and 
Elizabeth E. Murray, U.S. Application 08/369,839, filed June 6, 
1995. 

November 7, 1996 -Interference 103,781 was declared 
essentially as follows (Paper No. 2): 



JUNIOR PARTY APPLICATION 



Named Inventors: 



Kenneth A. Barton and Michael J. Miller 



Application : 



Application 07/827,906, filed 
January 30, 1992 



Title: 



Improved Expression of Genes in Plants 



Assignee : 



None (assignment to Monsanto Company 
recorded October 15, 1996; assignment 
to Monsanto Technology LLC recorded 
June 13, 2001) 



Accorded benefit 
for the purpose of 
priority v jof : 



Application 07/390,561, filed August 7, 
1989 



JUNIOR PARTY APPLICATION 



Named Inventors: 



David A. Fischhoff and Frederick J. 
Perlak 



Application : 



Application 08/434,105, filed May 3, 
1995 



Title: 



Synthetic Plant Genes and Method for 
Preparation 
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Assignee ; 



Accorded benefit 
for the purpose of 
priority of: 



None (assignment to Monsanto Technology 
LLC recorded June 13, 2001) 



Application 07/959,506, filed October 9, 
1992, now U.S. Patent 5,500,365, issued 
March 3, 1996; Application 07/476,661, 
filed February 12, 1990, now abandoned; 
and Application 07/315,355, filed 
February 24, 1989, now abandoned 

SENIOR PARTY PATENT 



Named Inventors: 



Application : 



Title: 



Michael J. Adang, Thomas A. Rocheleau, 
Donald J. Merlo and Elizabeth E. Murray 

Application 08/057,191, filed May 3, 
1993, now U.S. Patent 5,380,831, issued 
January 10, 1995 

Synthetic Insecticidal Crystal Protein 
Gene 



Assignee : 



Accorded benefit 
for the purpose of 
priority of: 



Mycogen Plant Science, Inc. (Paper 
No. 13) 



Applications 07/827,844, filed 
January 28, 1992, now abandoned, 
and Application 07/242,482, filed 
September 9, 1988, now abandoned 

Count 1 



A method of designing a synthetic Bacillus 
thuringiensis gene to be more highly expressed in plants, 
comprising the steps of: 

a) analyzing the coding sequence of a gene derived 
from Bacillus thuringiensis which encodes an insecticidal 
protein toxin, and modifying a portion of said coding 
sequence to yield a modified sequence which contains a 
greater number of codons preferred by the intended plant 
host than did said coding sequence, or 
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b) analyzing the coding sequence of a gene derived 
from Bacillus thurinaiensis which encodes an insecticidal 
protein toxin, and modifying a portion of said coding 
sequence to yield a modified sequence which contains a 
greater number of codons preferred by the intended plant 
host than did said coding sequence and fewer plant 
polyadenylation signals than said coding sequence. 

The claims of the parties which correspond to this 
count are: 

Barton et al.: Claims 1-4, 7, and 15-22 

Fischhoff et al.: Claims 3, 5, and 39-43 
Adang et al.: Claims 1-14. 

December 12, 1996 - An administrative patent judge (APJ) 



entered an Order to Show Cause stating (Paper No. 11, pp. 1-2, 



bridging para . ) : 



In view of the common ownership by Monsanto 
Company of the Barton application and the Fischhoff 
application, the junior party Barton is ordered 
to show cause why judgement should not be entered 
against him within 30 days from* the date of this 
- order. Monsanto Company, as the assignee of both 
Barton and Fischhoff, may name the prior inventor 
in response to this order. Cf. M.P.E.P. 2302. 

January^I-7/- 1997 - The APJ ordered Monsanto Company "to name 
the prior inventor of count 1 . . . . In the event Monsanto 
makes no election, judgment will be entered against junior party 
Barton" (Paper No. 29, p. 3). 

February 3, 1997 - Junior Party Barton et al. (hereafter 
Barton) petitioned the Commissioner under 37 CFR § 1.644(a)(1) 
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to reverse or postpone the AP J' s January 17, 1997 order (Paper 
No. 35) . 

March 26, 1997 - Barton's February 3, 1997, petition was 
denied (Paper No. 38). 

June 19, 1997 - The Board of Patent Appeals and 

Interferences (hereafter Board) entered the following judgment 

(Paper No. 53) : 

Whereas Monsanto, the common assignee of the 
Barton et al. and Fischhoff et al. applications has 
named the party Fischhoff et al. as the prior inventor 
of count 1, pursuant to 37 CFR 1.602(a) and 1.610(e) 
judgement is hereby entered against Barton et al. 
as to the subject matter of count 1. Accordingly, • 
Kenneth A. Barton and Michael J. Miller are not 
entitled to a patent containing Claims 1-4, 7, and 
15-22 corresponding to count 1. The interference 
will continue as Fischhoff et al. v. Adang et al. 

June 27, 1997 - Barton filed notice under 35 U.S.C. §§ 141 
and 142 of appeal to the U.S. Court of Appeals for the Federal 
Circuit from the judgment of the Board of Patent Appeals and 
Interferences entered June 17, 1997 (Paper No. 55) . 

February "T,"" 1998 - The U.S. District Court for the District 
of Delaware entered a judgment ( Mvcoaen Plant Science, Inc. v. 
Monsanto Co. , No. 96-505 (D. Del. Feb. 5, 1998)) in a suit' 
brought by Mycogen Plant Science, Inc., and Agrigenetics Inc. 
against Monsanto Co., DeKalb Genetics Corp., and Delta and Pine 
Land Co. for infringement of two Mycogen patents (Adang et al., 
U.S. Patent 5,567,862, entitled "Synthetic Insecticidal Crystal 
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Protein Gene," which issued October 22, 1996, from U.S. 
Application 08/369,839, filed January 6, 1995, and Adang et al., 
- S . Patent 5,567,600, entitled "Synthetic Insecticidal Crystal 
Protein Gene," which issued October 22, 1996, from U.S. 
Application 08/369,835, filed January 6, 1995). A jury rendered 
a verdict that (1) defendants' products did not literally 
infringe any of the contested claims of Mycogen's patents, and 
(2) all of the contested claims of Mycogen's patents are invalid 
because Monsanto invented the subject matter thereof before the 
priority dates of Mycogen's patents. See the Procedural History 
in Mvcoaen Plant Sci., Inc. v. Monsanto Co. , 243 F.3d 1316, 
1320-1321, 58 USPQ2d 1030, 1033-1034 (Fed. Cir. 2001) (Paper 
No. 125) . 

December 9, 1998 - The Court of Appeals for the Federal 
Circuit reversed the Board's June 19, 1997 judgment and remanded 
(Paper No. 124). Barton v. Adang , 162 F.3d 1140, 49 USPQ2d 1128 
(Fed. Cir. .1S98),( Paper No. 118, Exhibit A). 

September 8, 1999 - The U.S. District Court for the District 
of Delaware entered a revised order (Paper No. 125, Exh. H) and 
opinion ruling on post-trial motions (Paper No. 125, Exh. I) 
( Mvcoaen Plant Sci., Inc. v. Monsanto Co. , 61 F. 2d 199 
(D. Del.' 1999)) which "granted Mycogen's motion for JMOL that 
Monsanto' s processes and resulting products infringed . . . the 
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'600 and '862 patents." Mvcogen Plant Sci., Inc. v. Monsanto 

Co. , 243 F.3d at 1320, 58 USPQ2d at 1034 (Paper No. 146). 

The district court also denied Mycogen's motion for JMOL 
•<■ requesting the court to set aside the jury's finding of 
anticipation due to prior invention by Monsanto. 

The district court granted Monsanto' s motion for 
JMOL holding that the claims of the '600 and '862 patents 
were invalid for lack of enablement pursuant to 35 U.S.C. 
§ 112 

Id. at 1321, 58 USPQ2d at 1034 (Paper No. 146). 

November 10, 1999 - In a suit brought by Mycogen Plant 

Science, Inc. and Agrigenetics Inc. against Monsanto Company for 

infringement of plaintiff's patent (Adang et al., U.S. Patent 

5,380,831, issued January 10, 1995, from U.S. Application 

08/057,191, filed May 3, 1993), the U.S. District Court for the 

Southern District of California entered an order ( Mycogen Plant 

Sci., Inc. v. Monsanto Co. , No. 95-CV-653 (S.D. Cal. Nov. 10, 

1999) (Paper No. 127, Exh. A) which granted defendant's motion for 

summary judgment that Claims 1-12 of Mycogen's '831 patent are 

invalid under "35 U.S.C. § 102(g) and/or § 103 because Monsanto 

invented the subject matter thereof before Mycogen, as determined 

by the U.S. District Court for the District of Delaware in 

Mvcogen Plan-t Sci., Inc. v. Monsanto Co. , 61 F. Supp.2d 199 

(D. Del. 1999), which was affirmed in Mycogen Plant Sci., Inc. v. 

Monsanto Inc. . 243 F.3d 1316, 58 USPQ2d 1030 (Fed. Cir. 2001), 

and denied defendant's motion for summary judgment that the 
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contested claims of Mycogen' s '831 patent are invalid for 
noncompliance with the enablement requirement of the first 

paragraph of 35 U.S.C. § 112 as moot (Paper No. 127, Exh. A). 

v 

January 18, 2000 - U.S. patent 6,015,891 issued from Michael 
J. Adang and Elizabeth E. Murray, U.S. Application 08/705,438, 
filed August 29, 1996, subject to disclaimers of its term 
extending beyond the statutory expiration dates of Adang et al., 
U.S. Patent 5,567,600, issued October 22, 1996, and Adang et al., 
U.S. Patent 5,380,831, issued January 10, 1995. 1 



But for prior issued patents identified in the terminal 
disclaimers entered as Paper No. 23 in U.S. patent 6,015,891 and 
Paper No. 25 in U.S. Patent 6,013,523, both disclaimers read: 

The owner of 100% interest in the instant 
application, Mycogen Plant Science, Inc., hereby 
disclaims, except as provided below, the terminal 
part of the statutory term of any patent granted on 
the instant application, which would extend beyond 
the expiration date of the full statutory term defined 
in 35 U.S.C. 154 to 156 and 173 for U.S. Patent No . . . 
issued^Qjw..-.. . - • 

In making the above disclaimer, the owner does not 
disclaim the terminal part of any patent granted on the 
instant application that would extend to the expiration 
date of the full statutory term as defined in 35 U.S.C. 
154 to 156 and 173 of U.S*. Patent No ... in the event 
that any of the . . . patents: expires for failure to pay 
a maintenance fee, is held unenforceable, is found invalid 
by a court of competent jurisdiction, is statutorily 
disclaimed in whole or terminally disclaimed under 37 CFR 
1.321, has all claims cancelled by a reexamination 
certificate, is reissued, or is in any manner terminated 
prior to the expiration of its full statutory term. 
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January 18, 2000 - U.S. patent 6,013,523 issued from Michael 
J. Adang and Elizabeth E. Murray, U.S. Application 08/704,966, 
filed August 29, 1996, subject to a disclaimer of its term 
extending beyond the statutory expiration date of Adang et al., 
U.S. Patent 5,567,862, issued October 22, 1996. 

March 12, 2001 - On appeal from the decision of the U.S. 

District Court for the District of Delaware in Mvcoaen Plant 

Sci., Inc. v. Monsanto Co. , 61 F. Supp.2d 199 (D. Del. 1999), the 

U.S. Court of Appeals for the Federal Circuit: 

. . . affirm [ed] the verdict of noninfringement based 
on patent invalidity due to prior invention pursuant 
to 35 U.S.C. § 102(g). This makes it unnecessary to 
address the finding of lack of enablement pursuant to 
35 U.S.C. § 112. 

Mvcoaen Plant Sci., Inc. v. Monsanto Co. , 243 F.3d at 1320, 

58 USPQ2d at 1033 (Paper No. 146). 

May 30, 2001 - On appeal from the decision of the U.S. 

District Court for the Southern District of California in Mvcoaen 

Plant Sci., Inc. v. Monsanto Co. , No. 95-CV-653 (S.D. Cal. 

Nov. 10, 1999) (Paper No. 127, Exh. A), the U.S. Court of Appeals 

for the Federal Circuit af f irmed-in-part , reversed-in-part , and 

remanded. Mvcoaen Plant Sci., Inc. v. Monsanto Co. , 252 F.3d 

1306, 1306, 58 USPQ2d 1891, 1892-1893 (Fed. Cir. 2001). The 

Federal Circuit concluded at 1309, 58 USPQ2d at 1893, that: 

. . . the district court improperly resolved disputed 
questions of material fact pertaining to the issue of 
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prior invention, and we therefore reverse the court's 
ruling on summary judgment that the '831 patent is 
invalid under 35 U.S.C. § 102(g). We decline to affirm 
the summary judgment of invalidity on the alternative 
ground of non-enablement , as urged by Monsanto, but 
v leave to the district court the task of determining in 
the first instance whether there is a genuine issue of 
material fact as to enablement based on its assessment 
of the evidence presented to it in the summary judgment 
proceeding . 

Id^ at 1310, 58 USPQ2d at 1894, the Federal Circuit explained: 

We agree with the district court that collateral 
estoppel requires the court to conclude that Monsanto 
reduced the invention [claimed in the Mycogen' s '831 
patent] to practice before Mycogen, and that collateral 
estoppel does not resolve the question whether Mycogen 
was the first to conceive and then was diligent during 
the critical period. On the merits of the' summary 
judgment question, however, we do not agree that 
Monsanto has met its burden of showing that there 
are no issues of material fact regarding whether 
Mycogen was the first to conceive the invention and 
then diligently reduce it to practice. 

2 . Outstanding requests and motions 

A. Adanq's Preliminary Motion 1 (Paper No. 45) 

By Adang' s Preliminary Motion 1 (Paper No. 45), Adang moves 

under 37 CFR § 1.633(c)(1) to redefine the interfering 

subject matter by substituting Proposed Substitute Count 2 

for existing Count 1. 

Fischhoff's Opposition (Paper No. 58) 
Adanq's Reply (Paper No. 96) 

B. Adanq's Preliminary Motion 2 (Paper No. 46) 

By Adang's Preliminary Motion 2 (Paper No. 46), Adang moves 
under 37 CFR § 1.633(f) for benefit of the January 28, 1992, 
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filing date of U.S. Application 07/827,844, and the September 9, 
1988 filing date of U.S. Application 07/242,482, for Proposed 
Substitute Count 2 of Adang' s Preliminary Motion 1 (Paper 
No. 45) . 

Fischhof f ' s response (Paper No. 59) 
Adang' s Reply (Paper No. 97) 

C. Adang' s Contingent Preliminary Motion 3 (Paper No. 47) 

By Adang' s Contingent Preliminary Motion 3 (Paper No. 47), 

Adang moves under 37 CFR § 1.633(a) for judgment that Claims 3, 

5, and 39-43 of Fischhoff's involved U.S. Application 08/434,105, 

filed May 3, 1995, all Fischhoff's claims designated as 

corresponding to the interference count, are unpatentable under 

35 U.S.C. § 102(g) over at least one claim of Barton's U.S. 

Application 07/827,906, filed January 30, 1992, designated as 

corresponding to the count, or under 35 U.S.C. § 103 in view 

of prior art including at least one claim of Barton's U.S. 

Application 07/827,906, filed January 30, 1992 , ■ designated as 

corresponding "to the count. The motion presumes that party 

Barton has been determined to be, or Monsanto Technology LLC, 

the assignee of both of Fischhoff's and Barton's involved 

applications, has elected to designate Barton as, first to invent 

the subject matter defined by current Count 1 as between parties 

Fischhoff and Barton (Paper No. 47, pp. 1-2) . In conjunction 

therewith, Adang contingently moves for permission to seek 
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deposition and documentary discovery relevant to the 

determination and/or election of Barton as first to invent the 

^ubject matter defined by the count as between Fischhoff and 

Barton (Paper No. 47, p. 2). 

Fischhoff s Opposition (Paper No. 62) 
Adang' s Reply (Paper No. 98) 

D. Fischhoff s First 37 CFR 1.642 Request (Paper No. 78) 
Fischhoff requests that the APJ exercise his discretion 

under 37 CFR § 1.642 to add commonly assigned Adang et al., 
U.S. Patent 5,567,600 (Fischhoff Exhibit 37 (FX 37)), issued 
October 22, 1996, to this interference, designate all twenty- 
four claims thereof as corresponding to the count, and set an 
additional preliminary motion period for the parties to file 
motions relative to the newly added patent (Paper No. 78, p. 1) . 

Adanq's Opposition (Paper No. 63) 
Fischhoff ' s Reply (Paper No. 102) 

E. Fischhoff 's Second 37 CFR 1.642 Request (Paper No-. 79) 
Fischhoff— requests that the APJ exercise his discretion 

under 37 CFR § 1.642 to add commonly assigned Adang et al., 
U.S. Patent 5,567,862 (FX 3), issued October 22, 1996, to this 
interference, designate all twenty-four claims thereof as 
corresponding to the count, and set an additional preliminary 
motion period for the parties to file motions relative to the 
newly added patent (Paper No. 79, p. 1) . 
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Adang's Opposition (Paper No. 64) 
Fischhoff's Reply (Paper No. 103) 

F. Fischhoff's Preliminary Motion 3 (Paper No. 80) 
By Fischhoff's Preliminary Motion 3 (Paper No. 80), 

Fischhoff moves under 37 CFR § 1.633(a) for judgment that 

Claims 1-12 of Adang's U.S. Patent 5,380,831 (FX 11), issued 

January 10, 1995, are unpatentable under 35 U.S.C. § 112, second 

paragraph (Paper No. 80, p. 1) . 

Adang's Opposition (Paper No. 65) 
Fischhoff's Reply (Paper No. 104) 

G. Fischhoff's Preliminary Motion 4 (Paper No. 81) 
By Fischhoff's Preliminary Motion 4 (Paper No. 81), 

Fischhoff moves under 37 CFR § 1.633(c) (1) to redefine the 

subject matter of, this interference by substituting any one of 

Fischhoff's Proposed Counts 2, 3 and 4 for original Count 1 

(Paper No. 81, p. 1) . 

Adang's Opposition (Paper No. 69) 
Fischhoff's Reolv (Paper No. 108T 

H. Fischhoff's Preliminary Motion 5 (Paper No. 82) 
By Fischhoff's Preliminary Motion 5 (Paper No. 82), 

Fischhoff moves under 37 CFR § 1.633(a) for judgment that 
Claims 1-12 of Adang's U.S. Patent 5,380,831 (FX 11), issued 
January 10, 1995, are unpatentable under 35 U.S.C. § 112, first 
paragraph (enablement requirement) (Paper N. 82, p. 1) . 
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Adanq's Opposition (Paper No. 66) 
Fischhoff's Reply (Paper No. 105) 

I. Adang's 37 CFR § 635 Motion For 

Order Implementing the CAFC Decision 
■< ' In Barton v. Adang (Paper .No". 116) 

Adang moves under 37 CFR § 1.635 (Paper No. 116) for an 

order implementing the decision in Barton v. Adang , 162 F.3d 

1140, 49 USPQ2d 1128 (Fed. Cir. 1998) (Paper No. 118, Exhibit A). 

Joint Opposition Fischhoff and Barton (Paper No. 119) 
- Adang's Reply (Paper No. 121) 

J. Fischhoff's Preliminary Motion 6 (Paper No. 83) 

By Fischhoff's Preliminary Motion 6 (Paper No. 83), 

Fischhoff moves under 37 CFR § 1.633(c) (2) to redefine the 

subject matter of the interference by adding proposed Claims 44 

and 45 to Fischhoff's involved U.S. Application 08/434,105, filed 

May 3, 1995 (Paper No. 84), and designating the new claims as. 

corresponding to the count (Paper 83, pa. 1) . 

K. Fischhoff's Preliminary Motion 7 (Paper No. 85) 

By Fischhoff's Preliminary Motion 7 (Paper No. 85), 

Fischhoff moves under 37 CFR § 1.633(a) for judgment that 

Claims 1-12 of Adang's U.S. Patent 5,380,831 (FX 11), issued 

January 10, 1995, are unpatentable under 35 U.S.C. § 102 or 

§ 103 (Paper No. 85, p. 1) . 

Adang's Opposition (Paper No. 68) 
Fischhoff's Reolv (Paper No. 107) 
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L. Fischhoff's Preliminary Motion 8 (Paper No. 86) 

By Fischhoff's Preliminary Motion 8 (Paper No. 86), 

Fischhoff moves under 37 CFR § 1.633(f) to be accorded benefit of 

the October 9, 1992, filing date of Fischhoff's U.S. Application 

07/959,506; the February 12, 1990, filing date of Fischhoff's 

U.S. Application 07/476,661; and the February 24, 1989, filing 

date of U.S. Application 07/315,355, for Fischhoff's Proposed 

Counts 2, 3, and 4 (Fischhoff's Preliminary Motion 4 , -Paper 

No. 81) (Paper No. 86, p. 1) . 

M. Fischhoff's Preliminary Motion 9 (Paper No. 87) 

By Fischhoff's Preliminary Motion 9 (Paper No. 87), 

Fischhoff moves under 37 CFR § 1.633(a) for judgment that 

Claims 1-12 of Adang et al., U.S. Patent 5,380,831, issued 

January 10, 1995, are unpatentable under 35 U.S.C. § 112, first 

paragraph (written description requirement) (Paper No. 87, p. 1) . 

Adang' s Opposition (Paper No. 67) 
Fischhoff's Reply (Paper No. 106) 

N. Fischhoff's Preliminary Motion 10 (Paper No. 88) 

By Fischhoff's Preliminary Motion No. 10, Fischhoff moves 

under 37 CFR § 1.633(c) (4) to redefine the subject matter of the 

interference by designating (1) Claims 41-43 of Fischhoff's 

involved U.S. Application 08/434,105, filed May 3, 1995, and 

(2) Claims 13-14 of Adang' s U.S. Patent 5,380,831 (FX 11), 
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issued January 10, 1995, as not corresponding to the count (Paper 
No. 88, p. 1) . 

Adang's Opposition (Paper No. 70) 
4' Fischhoff's Reolv (Paper No." 109) 

0. Fischhoff's Preliminary Motion 11 (Paper No. 89) 

By Fischhoff's Preliminary Motion 11 (Paper No. 89), 

Fischhoff moves under 37 CFR § 1.633(c)(2), contingent on 

granting Fischhoff's Second 37 CFR § 642 Request (Paper No. 79), 

to redefine the subject matter of the interference by adding 

proposed Claims 4 6 to Fischhoff's involved U.S. Application 

08/434,105, filed May 3, 1995 (Paper No. 90), and designating 

the new claim as corresponding to the count (Paper No. 89, p. 1) . 

P. Fischhoff's Preliminary Motion 12 (Paper No. 60) 

By Fischhoff's Preliminary Motion 12 (Paper No. 60), 

Fischhoff moves, contingent upon the granting of Adang's 

Preliminary Motion 1 (Paper No. 45), for benefit under 37 CFR 

§ 1.633(f) of the October 9, 1992, filing date of Fischhoff's 

U.S. Application 07/959,506, the February 12, 1990, filing date 

of Fischhoff's U.S. Applicat ion 07/476,661, and the February 25, 

1989, filing date of Fischhoff's U.S. Application 07/315,355, for 

subject matter defined by Adang's Proposed Substitute Count 2. 

Q. Fischhoff's 37 CFR § 1.641(a) Request (Paper No. 110) 

Fischhoff requests that the APJ exercise his discretion 

under 37 CFR § 1.641(a), notify the parties that Claims 1-12 of 
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Adang's U.S. Patent 5,380,831, issued January 10, 1995, appear to 

be unpatentable under 35 U.S.C. § 112, first paragraph (best mode 

requirement) , and set a time period for the parties to take 

testimony and present related evidence and argument (Paper 

No. 110, p. 2, para. I) . 

Adang's Response (Paper No. Ill) 

R. Fischhoff's 37 CFR § 1.635 Motion 
For Temporary Stay of Interference 
Under 37 CFR § 1 . 645 (d) ( Paper No. 118) ... . 

Fischhoff moves under 37 CFR § 1.635 for an order 

temporarily staying the interference proceedings under 37 CFR 

§ 1.645(d) in anticipation of an impending decision of the 

U.S. District Court for the Southern District of California 

"on a motion for summary judgment that the claims of . . . 

Adang's . . . U.S. Patent No. 5,380,831 are invalid under 

35 U.S.C. § 102(g) because of prior invention by . . . 

Fischhoff . . ." (Paper No, 118, p. 2, para. 1). 

Adang's Opposition (Paper No. 120) 

Fischhoff's Reply (Paper No. 122) 

Fischhoff's Supplement To Motion (Paper No. 125) 

Adang's Opposition To Supplement (Paper No. 126) 

S. Fischhoff's 37 CFR § 1.635 Motion 
For Temporary Stay of Interference 
Under 37 CFR § 1.645(d) (Paper No. 127) 

Fischhoff moves under 37 CFR § 1.635 for an order 

temporarily staying the interference proceedings under 37 CFR 

§ 1.645(d) pending a decision on appeal to the Federal Circuit of 

-23- 



Interference 103,781 



a decision of the U.S. District Court for the Southern District 

of California granting Monsanto' s motion for summary judgment 

that claims of Adang's U.S. Patent 5,380,831 are invalid under 

35 U.S.C. § 102(g) over the prior invention of Fischhoff (Paper 

No. 127, p. 2, para. I) . 

Adang's Opposition (Paper No. 128) 
Fischhoff' s Reolv (Paper No. 132) 

3 . Decisions on Outstanding Requests and Motions 

"A party filing a motion has the burden of proof to show 

that it is entitled to the relief sought in the motion. 37 CFR 

§ 1. 637 (a) . 

A. Adang's Preliminary Motion 1 (Paper No. 45) 
Adang moves under 37 CFR § 1.633(c) (1) to redefine the 
interfering subject matter by replacing existing Count 1 with 
Proposed Substitute Count 2. Existing Count 1 reads (Paper 
No. 45, p. 3; Paper No. 2) : 

Count 1 

A'^methbd of designing a synthetic Bacillus 
thuringiensis gene to be more highly expressed in plants, 
comprising the steps of: 

a) analyzing the coding sequence of a gene derived 
from Bacillus thuringiensis which encodes an insecticidal 
protein toxin, and modifying a portion of said coding 
sequence to yield a modified sequence which contains a 
greater number of codons preferred by the intended plant 
host than did said coding sequence, or 



b) analyzing the coding sequence of a gene derived 
from Bacillus thuringiensis which encodes an insecticidal 
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protein toxin, and modifying a portion of said coding 
sequence to yield a modified sequence which contains a 
greater number of codons preferred by the intended plant 
host than did said coding sequence and fewer plant 
jj . polyadenylation signals than said coding sequence. 

Proposed Substitute Count 2 reads (Paper No. 45, p. 2) : 

Proposed Substitute Count 2 

A method of designing a synthetic Bacillus 
thurinqiensis gene to be more highly expressed 
in plants, comprising the steps of: 

analyzing the coding sequence of a" gene derived from 
a Bacillus thurinqiensis which encodes an insecticidal 
protein toxin, and 

modifying a portion of said coding sequence to yield 
a modified sequence which contains a greater number of 
codons preferred by the intended plant host than did 
said coding sequence; 

or 

A method of designing a synthetic Bacillus 
thurinqiensis gene to be more highly expressed 
in plants, comprising the steps of: 

analyzing the coding sequence of a gene derived from 
a Bacillus thurinqiensis which encodes an insecticidal 
protein toxin, 

modifying a portion of said coding sequence to yield 
a modified -sequence which contains a greater number of 
codons preferred by the intended plant host than did 
said coding sequence, and 

further comprising the step of modifying a portion 
of said coding sequence to eliminate plant 
polyadenylation. signals. 

Adang submits Proposed Substitute Count 2 to eliminate the 

possibility that "the analyzing step in part b) of the current 
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count could be incorrectly interpreted as analyzing only for the 
presence of polyadenylation signals" (Paper No. 45, p. 8, last 
complete sentence) . Adang explains (Paper No. 45, p. 8, second 
full para - ) : 

Unfortunately, part b) of the current count, which 
uses the- language of Fischhoff's claim 39, recites both 
the claim 1 step of "modifying" to yield a modified 
sequence which contains a greater number of codons 
preferred by the intended plant host and the claim 4 
step of "modifying" to eliminate plant polyadenylation 
sequences as a single step. As a result, the single 
step of part b) of the current count could be incorrectly 
interpreted as analyzing for any characteristic that 
would give the combined result of having a greater 
number of codons preferred by the intended plant host 
and eliminating plant polyadenylation signals. 

The steps of the current count are not open to the 

interpretation Adang contemplates for two reasons. First, the 

alternative process defined by part b) of the current count more 

particularly comprises the step of "modifying a portion of said 

coding sequence to yield a modified sequence which contains a 

greater number of codons preferred by the intended plant host 

than did sal 5 coding sequence and fewer plant polyadenylation 

signals than said coding sequence " (emphasis added) . When all 

the language of the current count is considered, the analyzing 

step of the process comprising part b) must comprise separate 

analyses of the coding sequence of a Bt gene both (1) for plant 

codon preferences and (2) for polyadenylation signals, and the 

modifying step of the process comprising part b) must comprise 
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separate modifications of a portion of the coding sequence to 

yield a modified sequence which contains (1) a greater number of 

codons preferred by the intended plant host than did said coding 

sequence, and (2) fewer polyadenylation signals than said coding 

sequence. Relative to the coding sequence of the Bt gene 

analyzed, reduction in the number of polyadenylation signals 

present in the coding sequence cannot simultaneously result in a 

greater number of codons preferred by the intended plant host 

relative to the number of codons preferred by the intended plant 

host present in the originally analyzed Bt coding sequence. 

The above interpretation of the current count is consistent 

with the interpretations of substantially identical, language in 

the claims of Adang's U.S. Patents 5,567,600 and 5,567,862, both 

issued October 22, 1995, and the parent thereof, Adang's involved 

U.S. Patent 5,380,831, issued January 10, 1995, by the U.S. 

District Court for the District of Delaware. See the district 

court's claim interpretation in Mycogen Plant Scr. , Inc. v. 

Monsanto Co. , 61 F. Supp.2d 199, 215 (D. Del. 1999), affirmed on 

review by the Federal Circuit in Mvcoaen Plant Sci., Inc. v, 

Monsanto Co. , 243 F.3d 1316, 1326-1327, 58 USPQ2d 1030, 1038-1039 

(Fed. Cir. 2001) : 

Mycogen contests a portion of the district court's 
claim construction. Specifically, Mycogen contends that 
the district court's definition of the "greater number 
of codons preferred" language in independent claims 1, 
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2, 13 and 14 of the '600 patent is erroneous. 
However, the claim construction issue here relates 
to both the '600 and the '862 patent, as well as 
the original '831 parent patent, as all three patents 
contain claims that use the language disputed herein. 
Claim 1 of the '600 patent is representative, and it 
reads as follows: 

1. A method of designing a synthetic 
Bacillus thuringiensis gene to be more highly 
expressed in plants, comprising the steps of: 

(a) analyzing the coding sequence of a gene 
derived from a Bacillus thuringiensis which 
encodes a pesticidal protein toxin; . _ 

(b) modifying a portion of said coding sequence 
to yield a modified sequence which contains a 
greater number of codons preferred by the 
intended plant host than did said coding 
sequence prior to modification, said modification 
comprising reducing the number codons having 

CG in the codon positions II and III in a 
region between plant polyadenylation signals 
in said coding sequence; 

(c) inserting said modified sequence into the 
genome of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant 
cell to produce additional plant cells having 
said^ modified sequence in the genome of said 
a'dSitibnal plant cells, wherein said synthetic 
Bacillus thuringiensis gene is expressed to 
produce a pesticidal protein toxin. 

'600 patent, col. 31, lines 37-57 (emphasis added). 

In Mycogen , the district court held that: 

The phrase "greater number of codons preferred, " 
is satisfied where the newly-created synthetic 
gene has a higher number of those codons whose 
frequency in the native Bt gene was lower than 
their frequency in the intended plant host, 



) 
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and where the synthetic gene has an overall 
distribution of codon usage that is closer to 
that of the intended plant host. 

61 F.Supp.2d at 215 

Ultimately, the Federal Circuit said, Mvcoaen Plant Sci., 

Inc. v. Monsanto Co. , 243 F.3d at 1329-1330, 58 USPQ2d at 1041: 

[T] runcation could not satisfy the district court's 
definition of the claim 1 "greater number" definition, 
because it would not result in a higher number of 
those codons whose frequency in the native Bt gene 
was lower than their frequency in the intended plant 
host 

. . . Thus, the district court's claim construction 
regarding the "greater number of codons preferred" 
limitation was correct. 

The parties present different views of the clarity of the 
language of current Count 1 relative to the clarity of the 
language of Proposed Substitute Count 2 and the scope of the 
subject matter encompassed by each count. However, the issues 
raised are more appropriately considered in the context of the 
parties' motions for judgment' on the patentability of their 
claims designated as corresponding to the interference count 
under 35 U.S.C. § 112, second paragraph. Suffice it to say, the 
reasons Adang presents for redefining the interfering subject 
matter by replacing current Count 1 with Proposed Substitute 
Count 2 are based on an interpretation of the language of current 
Count 1 which the Federal Circuit has determined is incorrect as 
a matter of law. 
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Adang has not shown that it is entitled to the relief 
requested. Accordingly, Adang' s Preliminary Motion 1 (Paper 
tyo. 45) is DENIED . 2 

B. Adang 7 s Preliminary Motion 2 (Paper No. 46) 

Adang' s Preliminary Motion 2 (Paper No. 46) under 37 CFR 
§ 1.633(f) to be accorded benefit of the filing dates of U.S. 
Applications 07/827,844 and 07/242,482 for subject matter of 
Proposed Substitute Count 2 is contingent upon Adang' s 
Preliminary Motion 1 (Paper No. 45) under 37 CFR § 1.633(c) (1) to 
redefine the subject matter of this interference by replacing 
existing Count 1 with Proposed Substitute Count 2 being granted. 
Since Adang' s Preliminary Motion 1 has been denied, Adang' s 
Preliminary Motion 2 is DISMISSED . 

C. Adang' s Contingent Preliminary Motion 3 (Paper No. 47) 
Barton v. Adang , 162 F.3d 1140, 1146, 49 USPQ2d 1128, 1134 

(Fed. Cir. 1998) (Paper No. 118, Exhibit A) held that Monsanto 
Company hacL.shown good cause why (1) it should not have been 

2 Given that the Delaware district court interpreted the 
phrase "greater number of codons preferred" appearing in many, 
but not all, of the claims of Mycogen's U.S. Patents 5,567,600 
and 5,567,862 (the phrase also appears in Claim 1 of Mycogen's 
U.S. Patent 5,380,831 and Count 1 of this interference recites 
the language of Claim 1 of U.S. Patent 5,380,831), the issue 
yet to be considered is whether the subject matter of the 
interference count should be redefined utilizing the language 
of Claim 11 of U.S. Patent 5,380,831. This issue is raised 
by Fischhoff's Preliminary Motion 4 (Paper No. 81). 
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forced to elect the prior inventor of Count 1 as between 

Fischhoff and Barton before the subject matter defined by the 

interference count is finalized, and (2) Barton should not have 

been prematurely dismissed as a party to this interference: 

[A]t the time that Monsanto was forced to make an 
election between the Barton et al. and the Fischhoff 
et al. applications, it was not clear what the content 
of the final count would be or what proofs on dates 
of conception and reduction to practice Adang et al. 
would seek to establish. Also, if the final count 
as decided by the Board excluded subject matter 
disclosed in Barton et al., but not Fischhoff et al., 
Monsanto loses arguably patentable subject matter 
by early dismissal of Barton et al. 

At this stage of the proceedings, Monsanto could 
not determine which application, either Barton et al. 
or Fischhoff et al., would be the best evidence to 
establish priority. We hold that Monsanto has shown 
"good cause" to continue the interference on both its 
applications until the preliminary motions, to finalize 
the count are decided by the Board and discovery is 
complete . 

Because Monsanto Company was improperly ordered to elect the 
prior inventor of Count 1 as between Barton and Fischhoff before 
the interf er.en-ce count was finalized, neither Fischhoff s nor 
Barton' s claims designated- as corresponding to Count 1 may be 
rejected under 35 U.S.C. § 102 (g) /103 based on a forced election 
of the prior inventor of Count 1 by Monsanto Company at a time 
during the proceedings when "Monsanto could not determine which 
application, either Barton et al. or Fischhoff et al., would be 
the best evidence to establish priority." Id. Moreover, even if 
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Monsanto' s forced election of the prior inventor of Count 1 as 
between Barton and Fischhoff could be basis for rejecting the 

nonelected party's claims under 35 U.S.C. § 102(g)/103, 

> t ■ 

Monsanto' s election of Fischhoff as the prior inventor of Count 1 

most certainly does not support Adang' s Contingent Preliminary 

Motion 3 under 37 CFR § 1.633(a) for a judgment that Fischhoff s 

claims designated as corresponding to Count 1 are unpatentable 

under 35 U.S.C. § 102 (g) /103 over Barton's claims designated 

as corresponding to Count 1. 

Because the Federal Circuit directed this interference 

proceeding to continue with both Fischhoff and Barton as parties 

"until the preliminary motions to finalize the count are decided 

by the Board and discovery is complete," Barton v. Adang , 

162 F.3d at 1146, 49 USPQ2d at 1134, the discovery Adang requests 

is here, and is generally, inappropriate before the preliminary 

motion stage of the interference proceeding has been completed. 

Accordingly, Adang' s Contingent Preliminary Motion 3 and the 

order for discovery Adang requests (Paper No. 47) are DENIED . 

D. Fischhoff s First and Second 37 CFR 

§ 1.642 Requests (Paper Nos. 78 & 79) 

Fischhoff (Monsanto) asks the APJ to exercise his discretion 

under 37 CFR § 1.642 to (1) add Adang' s (Mycogen's) U.S. Patent 

5,567,600 (FX 37) and U.S. Patent 5,567,862 (FX 3), both issued 

October 22, 1996, to this interference, (2) designate all claims 
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of each patent as corresponding to the count, and (3) set an 
additional preliminary motion period for the parties to file 
motions relative to the newly added patents (Paper No. 78, p. 1; 
Paper No. 79, p. 1). However, on September 8, 1999, the U.S. 
District Court for the District of Delaware held that the claims 
of both Mycogen's U.S. Patent 5,567,600 (FX 37) and Mycogen's 
U.S. Patent 5,567,862 (FX 3) are invalid as unpatentable 

(1) under 35 U.S.C. § 102 (g) /103 due to prior invention of the 
subject matter thereof by Monsanto, and (2) under 35 U.S.C. 
§ 112, first paragraph, as based on a nonenabling disclosure. 
Mycogen Plant Science, Inc. v. Monsanto Co. , 61 F. Supp.2d 199 

(D. Del. 1999). The district court's decision that the claims of 
Mycogen's U.S. Patents 5,567,600 (FX 37) and 5,567,862 (FX 3) are 
invalid as unpatentable under 35 U.S.C. § 102(g)/103 was affirmed 
on appeal to the Federal Circuit. Mycoaen Plant Sci., Inc. v. 
Monsanto Co. , 243 F.3d 1316, 1320, 58 USPQ2d 1030, 1034 (Fed. 
Cir. 2001) paper- No. 146) . 

The parties to this interference have not "discussed the 
Delaware district court's conclusions that the claims of 
Mycogen's U.S. Patent 5,567,600 (FX 37) and the claims of 
Mycogen's U.S. Patent 5,567,862 (FX 3) are invalid as 
unpatentable under 35 U.S.C. § 102(g)/103 because of prior 
invention by Monsanto or as unpatentable under 35 U.S.C. § 112, 
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first paragraph (nonenablement ) . Mycogen has neither shown nor 
argued that the Delaware district court's conclusions are 
^ncorrect. Moreover, the Federal Circuit affirmed the Delaware 
district court's holding that the claims of Mycogen' s patents are 
invalid under 35 U.S.C. § 102(g) over Monsanto' s prior invention. 
Accordingly, Fischhoff's requests to redefine the interfering 
subject matter by adding Mycogen' s U.S. Patents 5,567,600 (FX 37) 
and 5,567,862 (FX 3) to this interference and designating the 
claims of those patents as corresponding to the count are 
DISMISSED . 

There is a related matter. On January 18, 2000, U.S. Patent 
6,015,891 (Appendix A) issued from Michael J. Adang and Elizabeth 
E. Murray, U.S. Application 08/705,438, filed August 29, 1996, 
including a terminal disclaimer of its term which extends beyond 
the statutory expiration dates of Mycogen' s U.S. Patent 5,567,600 
and U.S. Patent 5,380,831, issued January 10, 1995, and U.S. 
Patent 6, 013 f -&~23 (Appendix B) issued from Michael J. Adang and 
Elizabeth E. Murray, U.S. Application 08/704,966, filed 
August 29, 1996, including a terminal disclaimer of its term 
which extends beyond the statutory expiration date of Mycogen' s 
U.S. Patent 5,567,862. To completely resolve all patentability 
issues concerning what appears to be claims drawn to the "same 
patentable invention" as the claims presently designated as 
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corresponding to the count and priority of invention for the 
subject matter defined by the count as between the parties to 
this interference, a comparison of the representative method and 
gene claims presented in Mycogen's U.S. Patents 5,380,831, 
5,567,600, 5,567,862, 6,013,523, and 6,015,891, is in order. 
Claims 1 and 11 of Mycogen's U.S. Patent 5,380,831 read: 

I. A method of designing a synthetic Bacillus 
thuringiensis gene to be more highly expressed in plants, 

comprising the steps of: 

analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes an insecticidal 
protein toxin, and 

modifying a portion of said coding sequence to yield 
a modified sequence which contains a greater number of 
codons preferred by the intended plant host than did said 
coding sequence. 

II. A method of designing a synthetic Bacillus 
thuringiensis gene to be more highly expressed in plants, 
comprising the steps of: 

analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes an insecticidal 
protein toxin, and 

mocTifying a portion of said coding sequence to yield 
a modified sequence which has a frequency of codon usage 
which more closely resembles the frequency of codon usage 
of the plant in which it is to be expressed. 

Claim 1 of Mycogen's U.S. Patent 6,013,523 reads: 

1. A descendant plant cell comprising a pesticidal 
protein toxin encoded by a synthetic Bacillus thuringiensis 
( B. t ) gene, said cell produced by the process of: 

selecting a B . t . pesticidal protein toxin desired 
to be expressed in a plant cell; 
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obtaining a table indicating codon usage bias for 
a gene or genes more highly expressed in a plant cell 
than a native B . t . gene; . 

using said table to design a modified coding 
sequence which encodes said protein toxin, whereby said 
modified coding sequence has a frequency of codon usage 
that more closely resembles the frequency of codon usage 
of the plant in which it is to be expressed than did the 
native B. t . coding sequence having at least about 10% of 
the nucleotides changed as compared to the native B. t . 
coding sequence; 

obtaining a synthetic B . t . gene comprising a coding 
region comprising said modified coding sequence wherein 
said coding region is under the control of a plant- 
expressible promoters- 
introducing said synthetic B. t . gene into a plant 

cell; 

culturing said cell to obtain descendant plant cells 
or plants comprising descendant plant cells, said descendant 
plant cells comprising said synthetic B. t . gene; and 

establishing that said synthetic B. t . gene is 
expressed in said descendant plant cells. 

Claims 1 and 4 of Mycogen's U.S. Patent 6,015,891: 

1. A synthetic Bacillus thuringiensis ( B. t . ) gene 
which is expressed in descendant plant cells and encodes 
a pesticidal protein toxin, wherein said synthetic B. t . 
gene 13^'pr oduced by the process of: 

selecting a B . t . pesticidal protein toxin desired 
to be expressed in a plant cell; 

obtaining a table indicating codon usage bias for 
a gene or genes more highly expressed in a plant cell 
than a native B. t . gene; 

using said table to design a modified coding 
sequence which encodes said protein toxin, whereby said 
modified coding sequence has a frequency of codon usage 
that more closely resembles the frequency of codon usage 
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of the plant cell in which it is to be expressed than 
did the native B. t . coding sequence encoding said protein 
toxin, said modified coding sequence having at least 
about 10% of the nucleotides changed as compared to the 
native B . t . coding sequence; 

obtaining a synthetic B . t . gene comprising a coding 
region comprising said modified coding sequence wherein 
said coding region is under the control of a plant- 
expressible promoter; 

introducing said synthetic B. t . gene into a plant 

cell; 

culturing said cell to obtain descendant plant cells 
or plants comprising descendant plant cells, said descendant 
plant cells comprising said synthetic B. t . gene; and 

establishing that said synthetic B . t . gene is 
expressed in said descendant plant cells. 

4. A method of designing a synthetic Bacillus 
thuringiensis ( B.. t ) gene which is expressed in 
descendant plant cells, comprising the steps of: 

selecting a B . t . pesticidal protein toxin desired 
to be expressed in a plant cell; 

obtaining a table indicating codon usage bias for 
a gene or genes more highly expressed in a plant cell 
than a native B. t . gene; 

using said table to design a' modified coding 
sequence' which encodes said protein toxin, whereby said 
modified coding sequence has a frequency of codon usage 
that more closely resembles the frequency of codon usage 
of the plant cell in which it is to be expressed than 
did the native B . t . coding sequence encoding said protein 
toxin, said modified coding sequence having at least 
about 10% of the nucleotides changed as compared to the 
native B . t . coding sequencer- 
obtaining a synthetic B . t . gene comprising a coding 
region comprising said modified coding sequence wherein 
said coding region is under the control of a plant- 
expressible promoter; 
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introducing said synthetic B . t . gene into a plant 

cell; 

culturing said cell to obtain descendant plant 
cells, said descendant plant cells comprising said 
4 synthetic B. t . gene; and 

establishing that said synthetic B. t. gene is 
expressed in said descendant plant cells. 

Claims 1, 2, 7, 8, 13, 14, 19 and 20 of Mycogen' s U.S. 

Patent 5,567,600 (FX 37), which were determined by the Delaware 

district court to be invalid, read: 

1. A method of designing a synthetic Bacillus 
thuringiensis gene to be more highly expressed in plants, 
comprising the steps of: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes a pesticidal 
protein toxin; 

(b) modifying a portion of said coding sequence 
to yield a modified sequence which contains a greater 
number of codons preferred by the intended plant host 
than did said coding sequence prior to modification, said 
modification comprising reducing the number codons having 
CG in codon positions II and III in a region between plant 
polyadenylation signals in said coding sequence; 

(c) ^inserting said modified sequence into the genome 
of a pTarft cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
sequence in the genome of said additional plant cells, 
wherein said synthetic Bacillus thuringiensis gene is 
expressed to produce a pesticidal protein toxin. 

2. A DNA coding sequence produced by: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes a pesticidal 
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protein toxin; and 

(b) modifying a portion of said coding sequence 
to yield a modified sequence which contains a greater 
number of codons preferred by the intended plant host 
than did said coding sequence, .said modification comprising 
reducing the number codons having CG in codon positions II 
and III in a region between plant polyadenylation signals 
in said coding sequence. 

7. A method of designing a synthesis Bacillus 
thuringiensis gene to be more highly expressed in plants, 
comprising the steps of: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes a pesticidal 
protein toxin; 

(b) modifying a portion of said coding sequence 

to yield a modified sequence which has a frequency of codon 
usage which more closely resembles the frequency of codon 
usage of the plant in which it is to be expressed than 
did said coding sequence prior to modification, said 
modification comprising reducing the number codons having 
CG in codon positions II and III in a region between plant 
polyadenylation signals in said coding sequence; 

(c) inserting said modified sequence into the genome 
of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
sequence "in the genome of said additional plant cells, 
wherein said synthetic Bacillus thuringiensis gene is 
expressed to produce a pesticidal protein toxin. 

8. A DNA coding sequence produced by: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes a pesticidal 
protein toxin; and 

(b) modifying a portion of said coding sequence 

to yield a modified sequence which has a frequency of codon 
usage which more closely resembles the frequency of codon 
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usage of the plant in which it is to be expressed than 
did said coding sequence, said modification comprising 
reducing the number codons having CG in codon positions II 
and III in a region between plant polyadenylation signals 
in said coding sequence. 

13. A method of designing a synthesis Bacillus 
thurinaiensis gene to be more highly expressed in plants, 
comprising the steps of: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thurinaiensis which encodes a pesticidal 
protein toxin; 

■(b) modifying a portion of said coding sequence 
to yield a modified sequence which contains a greater 
number of codons preferred by the intended plant host 
than did said coding sequence prior to modification, and 
wherein said modification results in fewer occurrences 
of the sequence AATGAA in said modified sequence than 
in said coding sequence; 

(c) inserting said modified sequence into the genome 
of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
sequence in the genome of said additional plant cells, 
wherein said synthesis Bacillus thurinaiensis gene is 
expressed to produce a pesticidal protein toxin. 

14^ vw ^ DNA coding sequence produced by: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thurinaiensis which encodes a pesticidal 
protein toxin; and 

(b) modifying a portion of said coding sequence 
to yield a modified sequence which contains a greater 
number of codons preferred by the intended plant 
host than did said coding sequence, and wherein said 
modification results in fewer occurrences of the 
sequence AATGAA in said modified sequence than in said 
coding sequence. 
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19. A method of designing a synthesis Bacillus 
thurinaiensis gene to be more highly expressed in plants, 
comprising the steps of: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thurinaiensis which encodes a pesticidal 
protein toxin; 

(b) modifying a portion of said coding sequence 

to yield a modified sequence which has a frequency of codon 
usage which more closely resembles the frequency of codon 
usage of the plant in which it is to be expressed than 
did said coding sequence prior to modification, and 
wherein said modification results in fewer occurrences 
of the sequence AATGAA in said modified sequence than in 
said coding sequence; 

(c) inserting said modified sequence into the genome 
of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
sequence in the genome of said additional plant cells, 
wherein said synthetic Bacillus thurinaiensis gene is 
expressed to produce a pesticidal protein toxin. 

20. A DNA coding sequence produced by: 

(a) analog the coding sequence of a gene derived 
from a Bacillus thurinaiensis which encodes a pesticidal 
protein toxin; and 

(b) modifying a portion of said coding sequence 

to yield a modified sequence which has a frequency of codon 
usage which more closely resembles the frequency of codon 
usage of the plant in which it is to be expressed than 
did said coding sequence, and wherein said modification 
results in fewer occurrences of the sequence AATGAA in 
said modified sequence than in said coding sequence. 

Claims 1, 6, 7, 12, 13, 18, 19, and 24 of U.S. Patent 

5,567,862 (FX 3) , read: 



-41- 



Interference 103,781 



1. A plant cell comprising a heterologous modified 
structural gene derived from a Bacillus thuringiensis 
gene encoding a pesticidal protein toxin, said plant 
cell produced by the steps of: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes a pesticidal 
protein toxin; 

(b) a portion of said coding sequence to yield a 
modified structural gene which contains a greater number 
of codons preferred by said plant cell than did said 
coding sequence prior to modification, said modification 
comprising reducing the number codons having CG in 
codon positions II and III in a region between plant 
polyadenylation signals in said coding sequence; 

(c) inserting said modified sequence into the genome 
of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
structural gene the genome of said additional plant cells, 
wherein said modified structural gene is expressed to 
produce a pesticidal protein toxin. 

6. A method of producing a pesticidal protein 
comprising the steps of: 

(a) introducing into a host plant cell a 
heterologous modified structural gene derived from a 
Bacillus thuringiensis gene wherein the DNA coding 
sequence of the Bacillus thuringiensis gene has been 
modified to contain a greater number of codons preferred 
by said plant cell than did said coding sequence prior 
to modification, said modification comprising reducing 
the number codons having CG' in codon positions II and 
III in a region between plant polyadenylation signals 

in said coding sequence, and 

(b) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
structural gene in the genome of said additional plant 
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cells, wherein said modified structural gene is expressed 
to produce a pesticidal protein toxin 

7. A plant cell comprising a heterologous modified 
structural gene derived from a Bacillus thuringiensis 
gene encoding a pesticidal protein toxin, said plant cell 
produced by the steps of: 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes a pesticidal 
protein toxin; 

(b) modifying a portion of said coding sequence 
to yield a modified sequence which has a frequency of 
codon usage which more closely resembles the frequency 
of codon usage of genes native to said plant cell than 
did said coding sequence prior to modification, said 
modification comprising reducing the number codons having 
CG in codon positions II and III in a region between plant 
polyadenylation signals in said coding sequence; 

(c) inserting said modified structural gene into 
the genome of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
gene in the genome of said additional plant cells, 
wherein said modified structural gene is expressed 

to produce a pesticidal protein toxin. 

12. A method. of producing a pesticidal protein 
comprising the steps of 

(a) introducing into a host plant cell a 
heterologous modified' structural gene derived from a 
Bacillus thuringiensis gene wherein the DNA coding 
sequence of the Bacillus thuringiensis gene has been 
modified to contain a frequency of codon usage that 
more closely resembles the frequency of codon usage 
of genes native to said plant cell than did said 
coding sequence prior to modification, said 
modification comprising reducing the number codons 
having CG in codon positions II and III in a region 
between plant polyadenylation signals in said coding 
sequence; and 
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(b) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
structural gene in the genome of said additional plant 
cells, wherein said modified structural gene is expressed 
■i ■ to produce a pesticidal protein toxin. 

13. A plant cell comprising a heterologous modified 
structural gene derived from a Bacillus thurinqiensis 
gene encoding a pesticidal protein toxin, said plant cell 
produced by the steps of 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thurinqiensis which encodes a pesticidal 
protein toxin; 

(b) modifying a portion of said codon sequence to 
yield a modified structural gene which contains a greater 
number of codons preferred by the intended plant host 
than did said coding sequence prior to modification, and 
wherein said modification results in fewer occurrences 

of the sequence AATGAA in said modified structural gene 
than in said coding sequence; 

(c) inserting said modified structural gene into 
the genome of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
structural" gene in the genome of said additional plant 
cells, wherein said modified structural gene is 
expressed to produce a pesticidal protein toxin. 

18. A method of producing a pesticidal protein 
comprising the steps of 

(a) introducing into a host plant cell a 
heterologous modified structural gene derived from a 
Bacillus thurinqiensis gene wherein the DNA coding 
sequence of the Bacillus thurinqiensis gene has been 
modified to contain a greater number of codons preferred 
by said plant cell than did said coding sequence prior 
to modification, and wherein the modification results in 
fewer occurrences of the sequence AATGAA in said modified 
structural gene than in said coding sequence; and 
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(b) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
structural gene in the genome of said additional plant 
cells, wherein said modified structural gene is expressed 
v to produce a pesticidal protein toxin. 

19, A plant cell comprising a heterologous modified 
structural gene derived from a Bacillus thuringiensis 
gene encoding a pesticidal protein toxin, said plant cell 
produced by the steps of 

(a) analyzing the coding sequence of a gene derived 
from a Bacillus thuringiensis which encodes a pesticidal 
protein toxin; 

(b) modifying a portion of said coding sequence 

to yield a modified structural gene which has a frequency 
of codon usage which more closely resembles the frequency 
of codon usage of genes native to said plant cell than 
did said coding sequence prior to modification, and 
wherein said modification results in fewer occurrences 
of the sequence AATGAA in said modified sequence than in 
said coding sequence; 

(c) inserting said modified structural gene into 
the genome of a plant cell; and 

(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
structural gene in the genome of said additional plant 

cells ^Herein said modified structural gene is 
expressed to produce a pesticidal protein toxin. 

24. A method of producing a pesticidal protein 
comprising the steps of 

(a) introducing into a host plant cell a 
heterologous modified structural gene derived from a 
Bacillus thuringiensis gene wherein the DNA coding 
sequence of the Bacillus thuringiensis gene has been 
modified to contain a frequency of codon usage that 
more closely resembles the frequency of codon usage 
of genes native to said plant cell than did the coding 
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sequence prior to modification, and wherein the 
modification results in fewer occurrences of the 
sequence AATGAA in said modified structural gene 
than in said coding sequence; and 

(b) maintaining said plant cell under conditions 
suitable to allow replication of said plant cell to 
produce additional plant cells having said modified 
structural gene in the genome of said additional plant 
cells, wherein said modified structural gene is expressed 
to produce a pesticidal protein toxin. 

Claims 1, 4, and 11 of Mycogen's U.S. Patent 5,380,831 are 
directed to methods of designing a synthetic B. t . gene to be more 
highly expressed in a plant than a native B. t . gene, comprising: 

(a) analyzing. the coding sequence of a native B . t . gene 
encoding an insecticide, and 

(b) modifying a portion of said analyzed coding sequence 

(i) to contain a greater number of codons preferred 
by said plant than did the analyzed coding 
sequence, and/or 

(ii) to yield a modified sequence which has a frequency 
._^ w of codon usage which more closely resembles the 

frequency of codon usage of said plant. 
By reference to substantially the same language appearing in 
claims of Mycogen' s U.S. Patent 5,467,600, the Federal Circuit 
indicated that the subject matter defined by the language of 
step (b) (i) above, which also appears in the .process defined by 
Claim 1 of Mycogen' s U.S". Patent 5, 380, 831, is either synonymous 
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with, of subsumed by, the language of step (b) (ii) above, which 

appears in the process defined by Claim 11 of Mycogen' s U.S. 

Patent 5,380,831. In Mvcoaen Plant Sci., Inc. v. Monsanto Co. , 

243 F.3d 1316, 1327, 58 USPQ2d 1030,' 1038 (Fed. Cir. 2001), the 

Federal Circuit quoted the following holding of the Delaware 

district court in reference to Claims 1, 2, 13, and 14 of 

Mycogen's U.S. Patent 5,567,600: 

[T]he phrase "greater number of codons preferred," 
is satisfied where the newly-created synthetic 
gene has a higher number of those codons whose 
frequency in the native Bt gene was lower than 
their frequency in the intended plant host, 
and where the synthetic gene has an overall 
distribution of codon usage that is closer to 
that of the intended plant host. 

Mvcoaen Plant Science, Inc. v. Monsanto Co. , 61 F. Supp.2d 199, 

215 (D. Del. 1999) . 

On appeal, Mycogen contested the Delaware district court's 

claim interpretation. See Mycogen Plant Sci., Inc. v. Monsanto 

Co. , 243 F.3d at 1326, 58 USPQ2d at 1038: 

Myco'gen contests a portion of the district court's 
claim construction. Specifically, Mycogen contends that 
the district court's definition of the "greater number 
of codons preferred" language in independent claims 1, 
2, 13 and 14 of the '600 patent is erroneous. 
However, the claim construction issue here relates 
to both the '600 and the '862 patent, as well as 
the original '831 parent patent, as all three patents 
contain claims that use the language disputed herein. 

The Federal Circuit stated, 243 F.3d at 1327,- 58 USPQ2d at 1039: 
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[T]he district court's claim construction defines a 
"preferred codon" to be any codon that brings the 
modified Bt gene's codon frequency closer to that 
of the intended plant host. 

*' Mycogen responded, Mycoaen Plant Sci., Inc. v. Monsanto Co. , 

243 F.3d at 1328-1329, 58 USPQ2d at 1040: 

[T]he district court's definition of "greater number 
of codons preferred" cannot stand because it renders 
this phrase synonymous with the "frequency of codon 
usage" limitation in independent claims 7, 8, 19 
and 20 of the '600 patent, thus rendering these two 
sets of claims identical. 

The Federal Circuit held that "the district court's claim 

construction regarding the 'greater number of codons preferred' 

limitation was correct." Mycoaen Plant Sci,, Inc. v. Monsanto 

Co. , 243 F.3d at 1330, 58 USPQ2d at 1041. 

The two-step methods claimed in Mycogen' s U.S. 5,380,831 are 

broader in scope than the four-step methods claimed in each of 

Mycogen's U.S. Patent 5,567,600, U.S. Patent 5,567,862, U.S. 

Patent 6,013,523, and U.S. Patent 6,015,891, because the claimed 

two-step methods., "of designing a synthetic Bacillus thurinqiensis 

gene to be more highly expressed in plants" are open to the 

additional steps of "inserting said modified sequence into the 

genome of a plant cell; and . . . maintaining said plant cell 

under conditions suitable to allow replication of said plant cell 

to produce additional plant cells having said modified sequence 

in the genome of said additional plant cells, wherein said 
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synthetic Bacillus thurinaiensis gene is expressed to produce a 
pesticidal protein toxin"; e.g., Claims 1 and 7 of Mycogen's U.S. 
Patent 5,567,600. Thus, the Delaware district court's conclusion 
that the four-step methods claimed in Mycogen's U.S. Patents 
5,567,600 and 5,567,862, which include the two-step methods 
claimed in Mycogen's U.S. Patent 5,380,831, are unpatentable 
under 35 U.S.C. § 102 (g) /103 and/or 35 U.S.C. § 112, first 
paragraph (enablement requirement), does not require a conclusion 
that the two-step methods claimed in Mycogen's U.S. Patent 
5,380,831 are unpatentable for all the same reasons. The 
enablement problems may stem from the added method steps. 
Showings required to establish priority of invention for four- 
step processes generally exceed showings required to establish 
priority of invention for at least two steps of the four-step 
processes. Furthermore, the Delaware district court's conclusion 
that claims of Mycogen's U.S. Patent 5,567,600, which are drawn 
to a DNA coding..,sequence produced by the method of Claims 1 and 
11 of Mycogen's U.S. Patent 5,380,831, are unpatentable under 
35 U.S.C. § 102(g)/103 and/or 35 U.S.C. § 112, first paragraph, 
does not require a conclusion that method Claims 1 and 11 of 
Mycogen's U.S. Patent 5,380,831 also are unpatentable under 
35 U.S.C. § 102(g)/103 and/or 35 U.S.C. § 112, first paragraph, 
for the same reasons. 
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However, the conclusion that the DNA coding sequences of 
Claims 2 and 8 of Mycogen's U.S. Patent 5,567,600, which are 
produced by the methods of Claims 1 and 11 of Mycogen's U.S. 
Patent 5,380,831, are directed to the "same . . . invention" 
as the methods of Claims 1 and 11 of Mycogen's U.S. Patent 
5,380,831, and accordingly, the "same . . . invention" as Count 1 
of this interference, is inescapable because the processes 
defined by Claims 1 and 11 of Mycogen's U.S. Patent 5,380,831 
and the processes which define the products-by-process of 
Claims 2 and 8 of Mycogen's U.S. Patent 5,567,600 are identical. 
Moreover, the four-step processes of designing a synthetic B . t . 
gene to be more highly expressed in plants of Claims 1 and 7 of 
Mycogen's U.S. Patent 5,567,600 would have been obvious under 
35 U.S.C. § 103 to persons having ordinary skill in the art in 
view of the two-step processes of Claims 1 and 11 of Mycogen's 
U.S. Patent 5,380,831 for designing a synthetic B. t . gene to be 
more highly^gxp-r.essed in plants, because the two additional 
steps of the four-step processes of the. .'600 patent no more than 
generally recite the utility described for the two-step processes 
of the x 831 patent. See 37 CFR § 1.642 and 37 CFR § 1.601(n). 
The product-by-process claims of Mycogen's U.S. Patent 5,567,862 
are directed to the "same . . . invention" as the process claims 
of Mycogen's U.S. Patent 5,567,600 again because the processes 
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defined by the product-by-process claims and the process claims 
of the respective patents are identical. 

^ Nevertheless, even though the Delaware district court held 
that the claims of Mycogen's U.S. Patents 5,567,600 and 5,567,862 
are invalid, the claims of Mycogen's U.S. Patents 6,013,523 and 
6,015,891 are presumed valid. Therefore, questions of greater 
import to the present interference are whether the claims of the 
later issued patents, which are presumed valid, are directed to 
the same patentable invention as Claims 1 and 11 of Mycogen's 
U.S. Patent 5,380,831 and Count 1, and accordingly, whether the 
subject matter of the count should be redefined by (1) adding 
Mycogen's U.S. Patent 6,015,891 to this interference and 
designating Claims 1-6 thereof as corresponding to the count, and 
(2) adding Mycogen's U.S. Patent 6,013,523 to this interference 
and designating Claims 1-4 thereof as corresponding to the count. 

First, the terminal portion of Mycogen's U.S. Patent 
6,015,891 Hbich-.extends beyond the expiration dates of Mycogen's 
U.S. Patents 5,567,600 and 5,380,831 has been disclaimed by 
Mycogen, and the terminal portion of Mycogen's U.S. Patent 
6,013,523 which extends beyond the expiration date of Mycogen's 
U.S. Patents 5,567,862 has been disclaimed by Mycogen. Terminal 
disclaimers are necessitated by an examiner's rejection of claims 
of a later-issued, commonly assigned patent for obviousness-type 
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double patenting of the subject matter claimed in an earlier- 
issued, unexpired patent. 

Second, Mycogen Plant Sci., Inc. v. Monsanto Co. , 243 F.3d 
at 1325, 58 USPQ2d at 1037, instructs: 

The district court analyzed the infringement 
decision and stated "[f]or every one of [Monsanto' s] 
accused genes and gene products, Mycogen presented 
evidence, detailed below, showing that a particular 
codon usage table was used to design each accused 
gene in a way that infringes the '600 and '862 patents. 
Defendants did not challenge this evidence. Mycogen, 
61 F. Supp. 2d at 245. 

Mycogen has established that the common analysis and modification 

steps of the processes claimed in Mycogen' s U.S. Patent 5,567,600 

and U.S. Patent 5,567,862 involve the steps of (I) "obtaining 

a table indicating codon usage bias for a gene or genes more 

highly expressed in a plant cell than a native B. t . gene," and 

(II) "using said table to design a modified coding sequence which 

encodes said protein toxin, whereby said modified coding sequence 

has a frequency of codon usage that more closely resembles the 

frequency of^co'don usage of the plant cell in which it is to be 

expressed than did the native B. t . coding sequence encoding said 

protein toxin, said modified coding sequence having at least 

about 10% of the nucleotides changed as compared to the native 

B. t . coding sequence" (Mycogen' s U.S. 6,015,891, Claims 1 and 4; 

Mycogen's U.S. 6,013,523, Claim 1). Method Claims 1 and 11 of 

Mycogen' s U.S. 5, 380, 831 comprise analysis and modification steps 
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substantially identical to those recited in the method claims 

of Mycogen's U.S. Patents 5,567,600 and 5,567,862. Moreover, 

the specification of Mycogen' s U.S. 5,380,831 teaches (col. 7, 
v - 
1. 1-28) (emphasis added) : 

Frequency of. preferred codon usage refers to the 
preference exhibited by a specific host cell in usage 
of nucleotide codons to specify a given amino acid. 
To determine the frequency of usage of a particular 
codon in a gene, the number of occurrences of that 
codon in the gene is divided by the total number of 
occurrences of all codons specifying the same amino 
acid in the gene. Table 1, for example, gives the 
frequency of codon usage for Bt genes, which was 
obtained by analysis of four Bt genes whose sequences 
are publicly available. Similarly, the frequency of 
preferred codon usage exhibited by a host cell can be 
calculated by averaging frequency of preferred codon 
usage in a large number of genes expressed by the host 
cell. It is preferable that this analysis be limited 
to genes that are highly expressed by the host cell. 
Table 1 (see page 43) , for example, gives the frequency 
of codon usage by highly expressed genes exhibited by 
dicotyledonous plants, and monocotyledonous plants. 
The dicot codon usage was calculated using 154 highly 
expressed coding sequences obtained from Genbank 
which are listed in Table 1. Monocot codon usage 
was calculated using 53 monocot nuclear gene coding 
sequences obtained from Genbank and listed in Table 1, 
located in Example 1 . 

When synthesizing a gene for improved expression 

in a host cell it is desirable to design the gene 

such that its frequency of codon usage approaches the 

frequency of preferred codon usage of the host cell . 

Mycogen' s U.S. Patent 5,380,831 expressly states (Col. 10, 

1. 62, to Col. 11, 1. 3) (emphasis added): 

For example, in preferred embodiments, the synthetic 
insecticidal protein is strongly expressed in dicot 
plants, e.g., tobacco, tomato, cotton, etc., and hence, 
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a synthetic gene under these conditions is designed to 
incorporate to advantage codons used preferentially by 
highly expressed dicot proteins. In embodiments where 
enhanced expression of insecticidal protein is desired 
in a monocot, codons preferred by highly expressed 
•i monocot proteins (given in Table 1) are employed in 
designing the synthetic gene. 

See also the disclosure of Mycogen's U.S. Patent 5,380,831 at 

column 11, lines 27-59, with regard to preferable avoidance of 

the CG dinucleotide in codon positions II and III, including 

references to prior art disclosing the "two codon choice indices 

for quantifying CG and AT doublet avoidance in codon positions II 

and III'' (col. 11, 1. 38-41) calculated for the plant data 

presented and tabulated in Table 2 (col. 11, 1. 44-59). Claim 3 

of Mycogen's U.S. Patent 5,380,831 explicitly recites "the step 

of modifying a portion of said coding sequence to yield CG and AT 

doublet avoidance indices which more closely resemble those of 

the intended plant host" (Col. 38, 1. 38-41, Claim 3). 

Third, Mycogen's U.S. Patent 5,380,831 includes the 

following statements (Col. 14, 1. 12-66; citations omitted): 

The recombinant DNA molecule carrying a synthetic 
structural gene under promoter control can be introduced 
into plant tissue by any means known to those skilled in 
the art. The technique used for a given plant species 
or specific type of plant tissue depends on the known 
successful techniques. . . . Once introduced into the 
plant tissue, the expression of the structural gene may 
be assayed by any means known to the art, and expression 
may be measured as mRNA transcribed or as protein 
synthesized. Techniques are known for the in vitro 
culture of plant tissue, and in a number of cases, 
for regeneration of whole plants. Procedures for 
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transferring the introduced expression complex to 
commercially useful cultivars are known to those 
skilled in the art. 

... Once plant cells expressing a synthetic 
insecticidal structural gene under control of a plant 
expressible promoter are obtained, plant tissues and 
whole plants are then reproduced by conventional 
means and the introduced genes can be transferred 
to other strains and cultivars by conventional plant 
breeding techniques . 

Given that Mycogen has terminally disclaimed the unexpired 
term of Mycogen' s U.S. Patent 6,015,891 which extends beyond 
the expiration dates of Mycogen' s U.S. Patents 5, 567, 600 and 
5,380,831 and the unexpired term of Mycogen' s U.S. Patent 
6,013,523 which extends beyond the expiration date of Mycogen' s 
U.S. Patent 5,567,862 and, presuming that the claims of Mycogen' s 
U.S. Patent 5,567,600, U.S. Patent 5,567,862, and U.S. Patent 
5,380,831 are directed to the same invention as Count 1 of this 
interference, it seems reasonable to conclude that the subject 
matter claimed of Mycogen' s U.S. Patents 6,015,891 and 6,013,523 
would have .bgen.,-„obvious to persons having ordinary skill in the 
art in view of prior art including Claims 1 and 11 of Mycogen' s 
U.S. Patent 5,380,831. Accordingly, arguments can be made that 
the claims of Mycogen' s U.S. Patents 6,015,891 and 6,013,523 are 
directed to the "same patentable invention" as the claims in 
Mycogen's U.S. Patent 5,380,831 and Count 1 of this interference. 
Therefore, the parties should consider whether Mycogen's U.S. 
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Patents 6,015,891 and 6,013,523 should be added to this 
interference . 

E. Fischhoff's Preliminary Motion 4 (Paper No. 81) 
By Fischhoff's Preliminary Motion 4 (Paper No. 81), 
Fischhoff moves under 37 CFR § 1.633(c) (1) to redefine the 
interference count by substituting Proposed Count 2, 3, or 4 for 
existing Count 1 (Paper No. 81, p. 1) . The alternatively 
proposed counts read as follows (Paper No. 81, pp. 14-16): 

Fischhoff's Proposed Count 2 
Claim 1 of the '831 patent; 3 
- or - 

Claim 3 of the '105 application. 4 



1. ['831 patent] A method of designing a synthetic 
Bacillus thuringiensis gene to be more highly expressed 
in plants, comprising the steps of: 

analyzing the coding sequence of a gene derived 
from Bacillus thuringiensis which encodes an insecticidal 
protein toxin, and 

modifying a portion of said coding . sequence to yield 
a modified sequence which contains a greater number of 
codons preferred by the intended plant host than did said 
coding sequence. 

4 3. ['105 application] A method for modifying a wild- 
type structural gene sequence which encodes an insecticidal 
protein of Bacillus thuringiensis to enhance the expression 
of said protein in plants which comprises: 

a) removing polyadenylation signals contains [sic] 
in said wild-type gene while retaining a sequence which 
encodes said protein; and 
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Fischhoff s Proposed Count 3 

All claims of the party Adang et al. that survive the 
37 CFR 1.633(a) and 37 CFR (c)(4) motions filed by the 
party Fischhoff et al.; 

- or - 

All claims of the party Fischhoff et al. (except 

claim 43) that survive any 37 CFR 1.633(a) and 

37 CFR (c) (4) motions filed by the party Adang et al. 

Fischhoff 's Proposed Count 4 

A. A method of .preparing a gene derived from a 

Bacillus thurinoiensis , said method comprising the 
steps of: 

(a) selecting a group of genes that encode expressed 
proteins in a host plant; 

(b) calculating the frequency of usage for codons 
within the structural coding region in said group 
of genes; 

(c) selecting a gene derived from Bacillus 
thurincriensis , which encodes an insecticidal 
protein having- an amino acid sequence; 

(d) preparing a gene encoding the same amino acid 
sequence as the gene selected in step © but which 
has a structural coding region with: 

(i) a frequency of usage for codons that 
closely approaches the frequency 
of usage for codons determined in 
step (b) ; 

(e) inserting the gene obtained in step (d) into said 
host plant's genome; and 



b) removing ATTTA sequences contained in said 
wild-type gene while retaining a sequence which encodes 
said protein. 
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(f) obtaining a transgenic plant with enhanced 

expression of a protein derived from a Bacillus 
thurinaiensis ; 

- or - 

B. A method of preparing a gene derived from a 

Bacillus thurinaiensis , said method comprising the 
steps of: 

(a) selecting a gene derived from a Bacillus 
thurinqiensis , which encodes an insecticidal 
protein having an amino acid sequence: 

(b) preparing a gene encoding the same. amino acid 
sequence as the gene selected in step (a) but 
which has a structural coding region with: 

(i) fewer ATTTA sequences,, 

(ii) fewer plant polyadenylation signals, and 

(iii) lower A+T content 

(c) inserting the gene obtained in step (b) into a 
host plant's genome; and 

(d) obtaining a transgenic plant with enhanced 
expression of a protein derived from a Bacillus 
thurinqiensis . 

The phrase "greater number of codons preferred" of Claim 1 

of Mycogen's U.S. 5,380,831 has been interpreted as follows by 

the Delaware district court in Mycoqen Plant Sci., Inc. v. 

Monsanto Co. , 61 F. Supp.2d at 215: 

[T]he phrase "greater number of codons preferred," 
is satisfied where the newly-created synthetic 
gene has a higher number of those codons whose 
frequency in the native Bt gene was lower than 
their frequency in the intended plant host, 
and where the synthetic gene has an overall 
distribution of codon usage that is closer to 
that of the intended plant host. 
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Given that interpretation, Fischhoff argues that existing Count 2 

ought to be replaced by Fischhoff s Proposed Count 2, 3, or 4 

because "Count 1 is narrower than the corresponding definition of 

gene modifications in Claim 3 of the '105 application" (Paper 

No. 81, p. 4, 1. 1-3). In effect, Fischhoff argues that 

Fischhoff' s Proposed Count 2, 3, or 4 should be substituted for 

existing Count 1 because the invention of Claim 3 of Fischhoff s 

involved U.S. Application 08/434,105 is not encompassed by, and 

does not correspond to, the interfering subject matter defined by 

existing Count 1. Whether presented in a motion under 37 CFR 

§ 1.633(c) (4) to redefine the interfering subject matter by 

designating Claim 3 of the '105 application as not corresponding 

to the count or in a motion under 37 CFR § 1.633(c) (1) to 

redefine the interfering subject matter by substituting a count, 

the issue is the same. Presuming that Claim 3 of Fischhoff s 

U.S. Application 08/434,105 stands properly designated as 

corresponding to Count 1, the question to be asked is whether 

Fischhoff has provided sufficient reasons why the relief it 

requests should be granted? 

Claim 3 of Fischhoff s involved U.S. Application 08/434,105 

reads (Paper No. 81, p. 4): 

3. ('105 Appl.) A method for modifying a wild-type 
structural gene sequence which encodes an insecticidal 
protein of Bacillus thuringiensis to enhance the 
expression of said protein in plants which comprises: 
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a) removing polyadenylation signals contained in said 
wild-type gene while retaining a sequence which 
encodes said protein; and 

. . b) removing ATTTA. sequences contained in said wild-type 
gene while retaining a sequence which encodes said 
protein. 

On its face, neither the step of part a) nor the step of part b) 
of Claim 3 of the Fischhof f ' s '105 application appears to create 
a synthetic gene having a "higher number of those codons whose 
frequency in the native Bt gene was lower than their frequency in 
the intended plant host," the interpretation given the language 
"greater number of codons preferred" of Count 1 of this 
interference and Claim 1 of Mycogen' s U.S. Patent 5,380,831 
by the Delaware district court. Mycogen Plant Sci., Inc. v. 
Monsanto Co. , 243 F.3d 1316, 58 USPQ2d 1030 (Fed. Cir. 2001), 
affirming Mycogen Plant Sci.,, Inc. v. Monsanto Co. , 
61 F. Supp.2d at 215. However, Count 1 of this interference 
alternatively comprises the step of "modify a portion of said 
coding sequ§»-ce-.-.to yield a modified sequence which contains a 
greater number of codons preferred by the intended plant host 
than did said coding sequence and fewer plant polyadenylation 
signals than said coding sequence" (Paper No. 2, Count lb,; 
emphasis added) . 

The criteria for determining whether a claim should be 
designated as not corresponding to the count is presented in 
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37 CFR § 1. 637 (c) (4) : 

A preliminary motion seeking to designate an 
application or patent claim as not corresponding to 
the count shall: 

(i) Identify the claim and the count. 

(ii) Show that the claim does not defined [sic] the 
same patentable invention as any other claim 
whose designation in the notice declaring the 
interference as corresponding to the count the 
party does not dispute. 

Fischhoff does not dispute that Claim 5 of Fischhoff's U.S. 

Application 08/434,105, as amended November 29, 1995, stands 

properly designated as corresponding to Count 1. Claim 5 of the 

'105 application, as amended November 29, 1995, reads: 

5. The method of Claim 3 further comprising the use 
of plant preferred sequences in the removal of the 
polyadenylation signals and ATTTA sequences. 

Claim 5 of the '105 application, as originally filed, differed 

from Claim 5, as amended November 29, 1995, only in its 

dependency on original Claim 4. Claim 4 of the '105 application, 

as originally,,-f-iled, defines the scope of the subject matter 

encompassed by Claim 5 designated as corresponding to Count 1. 

Original Claim 4 reads: 

4. A method of Claim 3 further comprising the removal 
of self-complementary sequences and replacement of such 
sequences with nonself-complementary DNA comprising 
preferred codons while retaining a structural gene 
sequence encoding said protein. 
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The designation of Claim 5 of the '105 application as 
corresponding to Count 1 is not disputed by Fischhoff. Thus, 
the only question to be decided is whether Claim 3 of the '105 
application defines the same patentable invention as Claim 5 of 
the '105 application. 

Claim 3 of the '105 application is directed to the same 
patentable invention as Claim 5 of the '105 application if the 
subject matter defined by Claim 3 would have been obvious under 
35 U.S.C. § 103 to a person having ordinary skill in the art 
in view of prior art cr&sumed to include the subject matter of 
Claim 5 of the '105 application. 37 CFR § 1.601(n). If it is 
properly dependent on Claim 3 of the '105 application, method 
Claim 5 of the '105 application must be construed to incorporate 
by reference all the process limitations of Claim 3 of the '105 
application to which it refers and set forth a further limitation 
thereof. 35 U.S.C. § 112, fourth paragraph. Neither Fischhoff, 
Adang, nor. -the^-examiner responsible for. initiating this 
interference rejected the patentability of Claim 5 of the- '105 
application under 35 U.S.C. § 112, fourth paragraph. Therefore, 
the conclusion is inescapable that the subject matter defined by 
independent Claim 3 of the '105 application would have been prima 
facie obvious to a person having ordinary skill in the art in 
view of the subject matter defined by dependent Claim 5 of the 
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*105 application which incorporates all the limitations of 
independent Claim 3 of the '105 application upon which it 
cjepends. Accordingly, Fischhoff's justification for redefining 
the subject matter of the interference by substituting substitute 
Count 2, 3, or 4 for Count 1 is undermined. Thus, Fischhoff has 
not met its burden to show it is entitled to the relief sought. 

Mycogen also suggests that Monsanto' s motion to redefine the 
interfering subject matter by substituting proposed Count 2, 3, 
or 4 for existing Count 1 is designed primarily to expand the 
scope of the subject matter defined by the interference count to 
include methods of designing a synthetic Bacillus thuringiensis 
gene to be more highly expressed in plants comprising the steps 
of analyzing the coding sequence of a gene derived from B , t . 
which encodes an insecticidal protein toxin, and modifying a 
portion of said coding sequence to yield a modified sequence 
which contains either (1) a greater number of codons preferred by 
the intende.d-plant than did said coding sequence, (2) fewer 
plant polyadenylation signals than did said coding sequence, or 
(3) fewer ATTTA sequences than did said coding sequence. Whether 
or not Monsanto' s aim is to expand the scope of the interfering 
subject matter, and concomitantly, the scope of the evidence upon 
which Mycogen and Monsanto may rely to show priority with respect 
to the same patentable invention claimed by the parties to this 
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interference, Monsanto' s motion may be warranted if the broader 
scope of the interfering subject matter defined by the substitute 
count better defines the scope of the patentable invention 
contested by the parties to this interference. 

Does any one of Fischhoff's Proposed Counts 2, 3, and 4 
better define the scope of the interfering subject matter for 
which priority of invention is contested by the parties to this 
interference than existing Count 1? If so, then Fischhoff's 
motion to redefine the subject matter claimed by substituting 
that proposed count for existing Count 1 should be granted. 

Proposed Count 4 alternatively defines methods of preparing 
a gene derived from B . t . which includes extraneous steps upon 
which the parties do not rely for patentability and language 
and/or terms whose meanings remain subject to interpretation; 
e.g., "calculating the frequency of usage for codons within the 
structural coding region in said group of genes" (Proposed 
Count 4A(b)J^.and. "lower A+T content" (Proposed Count 4B(b)(iii)). 
Proposed Count 3 creates more uncertainty than certainty as to 
the scope of the interfering subject matter. The full scope of 
the subject matter defined by Proposed Count 3 must be finally 
resolved before the parties can determine what evidence is 
relevant to their respective cases for priority of invention. 
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On the other hand, Proposed Count 2, which is directed 

alternatively to Claim 1 of Mycogen' s U.S. Patent 5,380,831 and 

Claim 3 of Monsanto's U.S. Application 08/434,104, is not only 

well defined, but it broadly encompasses claimed subject matter 

which the parties consider best representative of their 

inventions. Although Claim 1 of Mycogen' s U.S. Patent 5,380,831 

and Claim 3 of Monsanto' s U.S. Application 08/434,104 do not 

themselves appear to be directed to the same patentable 

invention, Claim 4 of Mycogen's U.S. Patent 5,380,831 and Claim 5 

of Monsanto' s U.S. Application 08/434,104, which are designated 

as corresponding to Count 1 (in-part a transcription of Claim 1 

of Mycogen' s U.S. Patent 5,380,831), clearly define the same 

patentable invention. See 37 CFR § 1.601(j): 

An inter f erence-in-f act exists when at least one 
claim of a party that is designated to correspond 
to the count and at least one claim of an opponent 
which is designated to correspond to the count 
define the same patentable invention. 

Neverthelesj^>^.iven the Delaware district court's interpretation 

of language in claims of Mycogen' s U.S. Patent 5,567,600 common 

to language in Claim 1 of Mycogen' s U.S. Patent 5,380,831 in 

Mycogen Plant Sci., Inc. v. Monsanto Co. , 61 F. Supp.2d 199, 215 

(D. Del. 1999), Fischhoff's Proposed Count 2 is not broad enough 

to encompass all claims that are patentable over the prior art 

and correspond to the count. See 37 CFR §§ 1.601(f) and 606. 
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Claims 11 and 12 of Mycogen's U.S. 5,380,831 are presumed to be 
patentable over the prior art and, on their face, appear to be 
broader in scope than the subject matter defined by either 
Claim 1 of Mycogen's U.S. Patent 5,380,831 or Claim 3 of 
Monsanto' s U.S. Application 08/434, 104. Accordingly, to the 
extent Fischhoff moves under 37 CFR § 1.633(c) (1) to redefine the 
interfering subject matter by substituting any one of proposed 
Counts 2, 3, and 4 for existing Count 1 (Paper No. 81), the 
motion is DENIED . 

However, Fischhoff alternatively invites the APJ to 
substitute a count of its own construction which is not narrower 
in scope than any application claim that is patentable over the 
prior art which is designated as corresponding to the count or 
any patent claim designated to correspond to the count. The 
invitation is accepted. 

It is ORDERED that Interference 103,781 is redeclared as: 



JUNIOR PARTY APPLICATION (Fischhoff) 



Named Inventors: 



Kenneth A. Barton and Michael J. Miller 



Application : 



Application 07/827,906, filed 
January 30, 1992 



Title: 



Improved Expression of Genes in Plants 



Assignee : 



None (assignment to Monsanto Company 
recorded October 15, 1996; assignment 
to Monsanto Technology LLC recorded 
June 13, 2001) 
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Accorded benefit 
for the purpose of 
priority of: 



Application 07/390,561, filed August 7, 
1989 



- or -. 



Named Inventors: 



David A. Fischhoff and Frederick J. 
Perlak 



Application : 



Application 08/434,105, filed May 3, 
1995 



Title: 



Assignee : 



Accorded benefit 
for the purpose of 
priority of: 



Synthetic Plant Genes and Method for 
Preparation 

None (assignment to Monsanto Technology 
LLC recorded June 13, 2001) 



Application 07/959,506, filed October 9, 
1992, now U.S. Patent 5,500,365, issued 
March 3, 1996; Application 07/476,661, 
filed February 12, 1990, now abandoned; 
and Application 07/315,355, filed 
February 24, 1989, now abandoned 

SENIOR PARTY PATENT (Adana) 



Named Inventors; 



Application: 



Title: 



Michael J. Adang, Thomas A. Rocheleau, 
Donald J. Merlo and Elizabeth E. Murray 

Application 08/057,191, filed May 3, 
1993, now U.S. Patent 5,380,831, issued 
January 10, 1995 

Synthetic Insecticidal Crystal Protein 
Gene 



Assignee : 



Mycogen Plant Science, Inc. (Paper 
No. 13) 
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Accorded benefit 
for the purpose of 

priority of: Applications 07/827,844, filed 

January 28, 1992, now abandoned, 
v and Application 07/242,482, filed 

September 9, 1988, now abandoned 

Count 2 

Any one of Claims 1-4, 7, and 15-22 of Barton et al.'s 
Application 07/827,906, filed January 30, 1992; 

- or - 

Any one of Claims 3, 5, and 39-43 of Fischhoff et al.'s 
Application 07/827,906, filed January 30, 1992; 

- or - 

Any one of Claims 1-14 of Adang et al.'s 

U.S. Patent 5,380,831, which issued January 10, 1995, 
from U.S. Application 08/057,191, filed May 3, 1993. 

The claims of the parties which correspond to this 
count are: 

Barton et al.: Claims 1-4, 7, and 15-22 

Fischhoff et al.: Claims 3, 5, and 39-43 
Adang et al.:. Claims 1-14. 

F. Fischhoff s Preliminary Motion 8 (Paper No. 86) 
By Fischhoff s uncontested Preliminary Motion 8 (Paper 
No. 86), Fischhoff moves under 37 CFR § 1.633(f) to be accorded 
benefit of the October 9, 1992, filing date of Fischhoff s U.S. 
Application 07/959,506; the February 12, 1990, filing date of 
Fischhoff s U.S. Application 07/476,661; and the February 24, 
1989, filing date of U.S. Application 07/315,355, for Fischhoff s 
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Proposed Counts 2, 3, and 4 (Fischhoff's Preliminary Motion 4, 
Paper No. 81) (Paper No. 86, p. 1). In that Fischhoff's 
Preliminary Motion 4 (Paper No. 81) has been denied for Proposed 
Counts 2, . 3, and 4, this motion also is DISMISSED . 

However, the declaration was redeclared in Paragraph E above 
with new Count 2 defining the interfering subject matter. For 
Count 2 of the redeclared Interference 103,781, Fischhoff has 
been accorded benefit for purposes of priority of the. October 9, 
1992, filing date of U.S. Application 07/959,506, now U.S. Patent 
5,500,365, issued March 3, 1996; the February 12, 1990, filing 
date of U.S. Application 07/476,661, now abandoned; and the 
February 24, 1989, filing date of U.S. Application 07/315,355, 
now abandoned. 

G. Fischhoff's Preliminary Motion 11 (Paper No. 89) 
By Fischhoff's Preliminary 11 (Paper No. 89), Fischhoff 
moves under 37 CFR § 1.633(c)(2) to redefine the interfering 
subject matter-by adding proposed Claim 4 6 to Fischhoff's U.S. 
Application 08/434,105, filed May 3, 1995 (Paper No. 90) and 
designating the new claim as corresponding to the count. The 
Motion is contingent upon granting Fischhoff's Second 37 CFR 
§ 642 Request (Paper No. 79). Since Fischhoff's Second 37 CFR 
§ 642 Request (Paper No. 79) has been dismissed, this motion also 
is DISMISSED. 
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H. Fischhoff's Preliminary Motion 12 (Paper No. 60) 
By Fischhoff's Preliminary Motion 12 (Paper No. 60), 

Fischhoff moves under 37 CFR § 1.633(f) for benefit of the 
October 9, 1992, filing date of Fischhoff's U.S. Application 
07/959,506; the February 12, 1990, filing date of Fischhoff's 
U.S. Application 07/476,661; and the February 25, 1989, filing 
date of Fischhoff's U.S. Application 07/315,355; for the 
subject matter defined by Adang's Proposed Substitute Count 2. 
Fischhoff's motion is contingent upon granting Adang's 
Preliminary Motion 1 (Paper No. 45). Adang's Preliminary 
Motion 1 (Paper No. 45) has been denied. Therefore, this motion 
is DISMISSED . 

I. Adang's 37 CFR § 635 Motion For 
Order Implementing the CAFC Decision 
In Barton v. Adang (Paper No. 116) 

In Part 3, Paragraph E, of this decision, it was ordered 

that this interference was redeclared, in effect implementing 

Barton v. Adang.. 162 F.3d 1140, 49 USPQ2d 1128 (Fed. Cir. 1998) 

(Paper No. 118, Exh. A). Adang's unopposed Rule 635 motion is 

GRANTED . 

J. Fischhoff's Preliminary Motion 6 (Paper No. 83) 
By Fischhoff's Preliminary Motion 6 (Paper No. 83), 
Fischhoff moves under 37 CFR § 1.633(c) (2) to redefine the 
interfering subject matter by amending Fischhoff's U.S. 
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Application 08/434,105 to add new Claims 44 and 45 (Paper No. 84) 

and designating new Claims 44 and 45 as corresponding to the 

count, now new Count 2 (see the Order redeclaring Interference 

103,781 in Part 3, paragraph E, of this decision). The motion is 

unopposed by Adang. Nevertheless, Fischhoff has the burden of 

proof to show that it is entitled to the relief sought in its 

motion. 37 CFR § 1.637(a). To show that it is entitled to the 

relief -sought in a motion under 37 CFR § 1.633(c) (2), the movant 

must (i) "[plropose an amended or added claim[;]" (ii) u [s]how 

that the claim proposed to be amended or added defines the same 

patentable invention as the count [;] " and (iii) "[s]how the 

patentability to the applicant of each claim proposed to be 

amended or added to the disclosure of the application . . . . " 

Part (b) of Fischhoff s proposed method Claim 44 and DNA coding 

sequence Claim 45 reads (Paper No. 83, pp. 2-3; Paper No. 84): 

(b) preparing a gene encoding the same amino acid 
sequence as the gene selected in step (a) but which has 
a structural coding region with: 

(i) fewer ATTTA sequences, 

(ii) fewer plant polyadenylation signals, and 

(iii) lower A+T content [. ] 

Fischhoff merely presumes that the method of proposed Claim 44 
and the DNA coding sequence of proposed Claim 45, both of which 
recite the additional step (b) (iii) of preparing a B . t . gene 
having a structural coding region with lower A+T content, would 
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have been obvious to persons having ordinary skill in the art in 
view of prior art including the method of modifying a wild-type 
structural gene which encodes an insecticidal protein .of B . t . 
defined by Claim 3 of Fischhoff's U.S. Application 08/434,105. 
Claim 3 of Fischhoff's U.S. Application 08/434,105 recites the 
following steps: 

a) removing polyadenylation signals contained in said 
wild-type gene while retaining a sequence which 
encodes said protein; and 

b) removing ATTTA sequences contained in said wild-type 
gene while retaining a sequence which encodes said 
protein . . . [ ; ] 

Fischhoff has not satisfied its burden to show that the 

additional step of preparing a B. t . gene having a structural 

coding region with lower A+T content would have been obvious 

to persons having ordinary skill in the art in view of prior 

art teachings including Claim 3 of Fischhoff's U.S. application 

08/434,105. Nor has Fischhoff shown that proposed Claims 44 and 

45 including^s.tep b) would have been obvious to persons having 

ordinary skill in the art in view of prior art including any 

other claim designated as corresponding to Count 2. Thus, 

Fischhoff has not established that the claims proposed to be 

amended or added define the same patentable invention as the 

count . 
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Similarly, Fischhoff has not established that proposed 
Claims 44 and 45 are patentable to Fischhoff. No claim which 
recites step b) (iii) of proposed Claims 44 and 45 has previously 
been presented in Fischhoff s U.S. Application 08/434,105 for 
interpretation by the examiner. When considered in light of the * 
supporting specification, step b) (iii) of preparing a gene 
encoding the same amino acid sequence as the gene selected 
in step (a) but which has a structural coding region with 
"lower A+T content" appears susceptible to a wide variety of 
interpretations. Thus, to satisfy its burden to show that 
proposed Claims 44 and 45 are patentable to Fischhoff, Fischhoff 
minimally should have shown that the meaning of the step b) (iii) 
would have been reasonably clear to persons having ordinary skill 
in the art in light of its supporting specification. Fischhoff 
has done no more than point to the same or substantially the same 
language in the specification. Accordingly, Fischhoff has not 
shown that it,i.s. entitled to the relief sought. Therefore, 
Fischhoff s Preliminary Motion 6 (Paper No... 83) under 37 CFR 
§ 1.633(c)(2) to redefine the interfering subject matter by 
amending Fischhof f ' s U.S. Application 08/434,105 to add new 
Claims 44 and 45 is DENIED. 
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K. Fischhoff's Preliminary Motion 10 (Paper No. 88) 
By Fischhoff's Preliminary Motion 10 (Paper No. 88), 
Fischhoff moves under 37 CFR § 1.66(c) (4) to redefine the 
interfering subject matter by designating (1) Claims 41-43 of 
Fischhoff's U.S. Application 08/434,105, and (2) Claims 13-14 of 
Adang's U.S. Patent 5,380,831 (FX 11), as not corresponding to 
the count. Adang opposes the motion (Paper No. 70). 

Each of Claims 41-43 of Fischhoff's U.S. Application 
08/434,105 is directed to "[a] modified chimeric gene comprising 
a promoter which functions in plant cells operably linked to a 
structural coding sequence and a 3 1 -nontranslated region 
comprising a polyadenylation signal which functions in plants to 
cause the addition of polyadenylate nucleotides to the 3 ! end of 
the RNA" (Paper No. 88, p. 2). For the Claim 41 chimeric gene, 
the structural coding sequence encodes an insecticidal protein 
derived from B.t. tenebrionis and comprises a structurally 
identified ^lZ9-l^.nucleotide sequence (Paper No. 88, p. 2). For 
the Claim 42 chimeric gene, the structural coding sequence 
encodes an insecticidal protein derived from B.t. entomocidus 
and comprises a structurally identified 3567 nucleotide sequence 
(Paper No. 88, p. 2) . For the Claim 43 chimeric gene, the 
structural coding sequence encodes a B.t. P2 insecticidal protein 
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and comprises a structurally identified 1905 nucleotide sequence 
(Paper No. 88, p. 2) . 

Each of Claims 13 and 14 of Adang' s U.S. Patent 5,380,831 is 

.* • 

directed to "[a] synthetic gene comprising [a particular segment 

of] the DNA sequence presented in Fig. 1" (Paper No. 88, p. 1). 

For Claim .13, the DNA segment spans nucleotides 1-1793 of the DNA 

sequence of Fig. 1. For Claim 14, the DNA segment spans 

nucleotides 1-1833 of the DNA sequence of Fig. 1. 

Claim -40 of Fischhoff's U.S. Application 08/434, 105, whose 

designation as corresponding to Count 2 Fischhoff does not 

contest, is more generally directed to (Paper No. 88, p. 1) : 

40. A synthetic gene which is derived from a 
Bacillus thuringiensis insecticidal protein toxin gene 
and which is more highly expressed in plants, wherein 
the coding sequence of said synthetic gene is modified 
to contain: 

a) a greater number of, codons preferred by the 
intended plant host than said insecticidal protein 
toxin gene; and 

b) fewer polyadenylation signal sequences than 
said ±Tise~cticidal protein toxin gene. 

Adang' s U.S. Patent 5,380,831 does not claim synthetic genes 

defined by a non-specif ically identified structural coding 

sequence. However, Claims 1 and'4 of Adang' s U.S. Patent 

6,015,891 (Appendix A) read: 

1. A synthetic Bacillus thuringiensis ( B . t . ) gene 
which is expressed in descendant plant cells and encodes 
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a pesticidal protein toxin, wherein said synthetic B . t . 
gene is produced by the process of: 

selecting a B. t . pesticidal protein toxin desired 
to be expressed in a plant cell; 

obtaining a table indicating codon usage bias for 
a gene or genes more highly expressed in a plant cell 
than a native B . t . gene; 

using said table to design a modified coding 
sequence which encodes said protein toxin, whereby said 
modified coding sequence has a frequency of codon usage 
that more closely resembles the frequency of codon usage 
of the plant cell in which it is to be expressed than 
did the native B . t . coding sequence encoding said protein 
toxin, said modified coding sequence having at least 
about 10% of the nucleotides changed as compared to the 
native B . t . coding sequence; 

obtaining a synthetic B ■ t . gene comprising a coding 
region comprising said modified coding sequence wherein 
said coding region is under the control of a plant- 
expressible promoters- 
introducing said synthetic B. t . gene into a plant 

cell; 

culturing said cell to obtain descendant plant cells 
or plants comprising descendant plant cells, said descendant 
plant cells comprising said synthetic B. t . gene; and 

establishing that said synthetic B. t . gene is 
expresS'gd^'in said descendant plant cells. 

4. A method of designing a synthetic Bacillus 
thurinaiensis ( B. t ) gene which is expressed in 
descendant plant cells, comprising the steps of: 

selecting a B. t . pesticidal protein toxin desired 

to be expressed in a plant cells- 
obtaining a table indicating codon usage bias for 

a gene or genes more highly expressed in a plant cell 

than a native B . t . gene; 
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using said table to design a modified coding 
sequence which encodes said protein toxin, whereby said 
modified coding sequence has a frequency of codon usage 
that more closely resembles the frequency of codon usage 
of the plant cell in which it is to be expressed than 
v did the native B . t . coding sequence encoding said protein 
toxin, said modified coding sequence having at least 
about 10% of the nucleotides changed as compared to the 
native B . t . coding sequence; 

obtaining a synthetic B. t . gene comprising a coding 
region comprising said modified coding sequence wherein 
said coding region is under the control of a plant- 
expressible promoter; 

introducing said synthetic B. t . gene into a plant 

cell; 

culturing said cell to obtain descendant plant 
cells, said descendant plant cells comprising said 
synthetic B. t . gene; and 

establishing that said synthetic B. t . gene is 
expressed in said descendant plant cells. 

Fischhoff argues that Claims 41-43 of Fischhoff's U.S. 

Application 08/434,105 and Claims 13 and 14 of Adang's 

U.S. Patent 5,380,831 are directed to a "separate patentable 

invention" from any of Fischhoff's and Adang's claims more 

generally directed to methods of designing synthetic genes 

derived from B. t . and synthetic genes derived from B . t . (Paper 

No. 88) . Fischhoff proffers that the specific structural DNA 

sequences defined by Fischhoff's Claims 41-43 and Adang's 13 

and 14 would not have been obvious to persons having ordinary 

skill in the art in view of prior art teachings including the 

subject matter defined by any one or all of Fischhoff's and 
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Adang's more general claims designated as corresponding to 
Count 2. To the contrary, Adang argues that the subject matter 
defined by one or more of Fischhof f ' s and Adang' s general claims 
designated as corresponding to Count 2 would have led persons 
having ordinary skill in the art to make and use any one of the 
specific synthetic structural gene sequences designed and/or 
encompassed by Fischhof f's and Adang' s more general claims 
designated as corresponding to the count with reasonable 
expectation that the synthetic B . t . gene produced would be more 
highly expressed in plants as compared to the corresponding 
unmodified native B , t . gene. Decision on Fischhof f s Preliminar 
Motion 10 is DEFERRED TO FINAL HEARING for the following reasons 

First, the evidence upon which Fischhoff and Adang may 
wish to rely in presenting their respective cases in chief for 
priority of the invention with regard to the subject matter more 
generally defined by Count 2 of this interference is likely- to 
include evidence, of conception and/or reduction to practice of 
the more limited subject matter, to which Claims 41-4 3 of 
Fischhoff's U.S. Application 08/434,105 and Claims 13 and 14 of 
Adang's U.S. Patent 5,380,831 are directed. Any evidence 
Fischhoff and Adang present for or against a conclusion that 
Claims 41-43 of its application and Claims 13-14 of Adang's 
patent are directed to separate patentable inventions from 
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any of the more general claims of the parties designated as 
corresponding to Count 2 is likely to affect the weight to be 
accorded evidence relative to specific structural gene sequences 
which Fischhoff or Adang may later present for the purpose of 
establishing that it first conceived and/or reduced to practice 
an invention encompassed by the more general scope of subject 
matter of Count 2. 

Second, if the specific structural sequences of Claims 41-43 
of Fischhoff s U.S. Application 08/434,105 and Claims 13 and 14 
of Adang' s U.S. Patent 5,380,831 are not directed to the same 
patentable invention as the generic synthetic gene sequences 
defined by Fischhoff s and Adang' s more general claims designated 
as corresponding to Count 2, then the adequacy of the written 
descriptive support for, and conception of an invention of, 
Fischhoff s and Adang' s remaining claims designated as 
corresponding to Count 2, which more generally define structural 
gene sequencg.s-,....-may be questioned. See Amgen Inc. v. Chugai 
Pharm. Co. , 927 F.2d 1200, 18 USPQ2d 1016 (Fed. Cir..) , cert- 
denied , 502 U.S. 856 (1991); Regents of the Univ. of Cal. v. 
Eli Lilly & Co. , 119 F.3d 1559, 43 USPQ2d 1398 (Fed. Cir. 1997); 
and Enzo Biochem, Inc. v. Gen-Probe Inc. , 285 F.3d 1-13, 62 
USPQ2d 1289 (Fed. Cir. 2002), vacated-in-part , reversed-in-part , 
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and remanded On Petition For Rehearing Appeal 01-1230, decided 
July 15, 2002 (slip opinion) . 

Therefore, Fischhoff's Preliminary Motion 10 (Paper No. 88) 
is DEFERRED TO FINAL HEARING. 

L. Fischhoff's Preliminary Motion 3 (Paper No. 80) 

By Fischhoff's Preliminary Motion 3 (Paper No. 80), 
Fischhoff moves under 37 CFR § 1.633(a) for a judgment that 
Claims 1-12 of Adang' s U.S. Patent 5,380,831 (FX 11) are 
unpatentable under 35 U.S.C. § 112, second paragraph. According 
to Fischhoff, the claims of Adang' s involved patent do not. 
particularly point out and distinctly claim the subject matter 
Adang regards as his invention for a variety of reasons 
considered separately below. It is essential that this motion be 
considered on the merits as early in the proceedings as possible, 
because questions related to the patentability of claimed subject 
matter under 35 U.S.C. § 112, first paragraph, § 102, and § 103, 
for examples-may not be adequately considered until the full 
scope of the claimed subject matter is clear. 

The language for which Claims 1-10 of Adang's patent are 
criticized first is the phrase "a modified sequence which 
contains a greater number of codons preferred by the intended 
plant host" (Paper No. 80, pp. 6-13) which appears in independent 
Claim 1 (FX 11) . According to Fischhoff, "there is no definition 

-80- 



Interference 103, 781 



in the '831 patent of 'codons preferred by the intended plant 
host" (Paper No. 80, p. 6, last full para.). However, in 
Mycogen Plant Sci., Inc. v. Monsanto Co. , 243 F.3d 1316, 
58 USPQ2d 1030 (Fed. Cir. 2001) (Paper No. 125), the Federal 
Circuit stated at 1326-1327, 58 USPQ2d at 1038-1039 (emphasis 
added) : 

Mycogen contests a portion of the district court's 
claim construction. Specifically, Mycogen contends that 
the district court's definition of the "greater number 
of codons preferred" language in independent claims 1, 
2, 13 and 14 of the '600 patent is erroneous. 
However, the claim construction issue here relates 
to both the '600 and the '862 patent, as well as 
the original '831 parent patent, as all three patents 
contain claims that use the language disputed herein . 
Claim 1 of the '600 patent is .representative, and it 
reads as follows: 

1. A method of designing a synthetic 
Bacillus thuringiensis gene to be more highly 
expressed in plants, comprising the steps of: 

(a) analyzing the coding sequence of a gene 
derived from a Bacillus thuringiensis which 
encodes a pesticidal protein toxin, 

(b) modifying a portion of said coding sequence 
to^yield a modified sequence which contains a 
greater number of codons preferred by the 
intended plant host than did said coding 
sequence prior to modification, said modification 
comprising reducing the number codons having 

- CG in the codon positions II and III in a 
region between plant polyadenylation signals 
in said coding sequence; 

(c) inserting said modified sequence into the 
genome of a plant cell; and 
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(d) maintaining said plant cell under conditions 
suitable to allow replication of said plant 
cell to produce additional plant cells having 
said modified sequence in the genome of said 
additional plant cells, wherein said synthetic 
Bacillus thurinqiensis gene is expressed to 
produce a pesticidal protein toxin. 

'600 patent, col. 31, lines 37-57 (emphasis added). 

In Mvcoqen , district court held that: 

[T]he phrase "greater number of codons preferred," 
is satisfied where the newly-created synthetic 
gene has- a higher number of those codons whose 
frequency in the native Bt gene was lower than 
their frequency in the intended plant host, 
and where the synthetic gene has an overall 
distribution of codon usage that is closer to 
that of the intended plant host. 

61 F.Supp.2d at 215. Thus, the district court's claim 
construction defines a "preferred codon" to be any codon 
that brings the modified Bt gene's codon frequency closer 
to that of the intended plant host. 

The Federal Circuit said, Mycoqen Plant Science Inc. v. 

Monsanto Co. , 243 F.3d at 1327, 58 USPQ2d at 1041: 

... Thus, the district court's claim construction 
regarding the "greater number of codons preferred" 
limitation-'. was correct . 

As indicated above-,- the \831 patent claim language "greater 
number of codons preferred," here criticized under 35 U.S.C. 
§ 112, second paragraph, for clarity, has been construed by both 
the Delaware district court and the Court of Appeals for the 
Federal Circuit to mean that- "the newly-created synthetic gene 
has a "higher number of those codons whose frequency in the native 
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Bt gene was lower than their frequency in the intended plant 
host, and where the synthetic gene has an overall distribution 
of codon usage that is closer to that of the intended plant 
host" based on a common patent specification. Mvcoaen Plant 
Sci., Inc. v. Monsanto Co. , 243 F.3d at 1326-1327, 58 USPQ2d 
at 1038-1039. Therefore, Fischhoff's present motion under 37 CFR 
§ 1.633(a) for judgment that Claims 1-10 of Adang' s U.S. Patent 
5,380,831 (FX 11) are unpatentable under 35 U.S.C. § 112, second 
paragraph, because the phrase "a modified sequence which contains 
a greater number of codons preferred by the intended plant host" 
(FX 11, Claim 1) does not particularly point out or distinctly 
claim the subject matter Adang regards as its invention must be 
denied. The scope of those claims of Adang' s U.S. Patent 
5,380,831 (FX 11) which are limited by that criticized language 
has been determined and is clear. 

Second, Fischhoff argues that the phrases "the intended 
plant host'^Qf^Claim 1 and "the plant in which it is to be 
expressed" of Claim 11 of Adang' s U.S. Patent. - 5, 380, .831 
themselves render Claims 1-12 of Adang's U.S. Patent 5,380,831 
vague and indefinite (Paper No. 80, pp. 14-18) . The argument 
is based on Fischhoff's perception that skilled artisans 
would interpret the word "plant" in the two phrases to refer to 
"the whole plant," an interpretation which is inconsistent with 
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statements in the specification of Adang's U.S. Patent 5,380,831 

(Paper No. 80, p. 14) . For example, immediately following the 

"Frequency Of Codon Usage" listing in Adang' s Table 1 (Adang's 

U.S. Patent 5,380,831, Table 1, col. 18), there is stated 

(emphasis added) : 

154 coding sequences of dicot nuclear genes were used 
to compile the codon usage table. The pooled dicot 
coding sequences, obtained from Genbank (release 55) 
or, when no Genbank file is specified, directly from 
the published source .... 

Thus, the step of "modifying a portion of said coding sequence to 

yield a modified sequence which has a frequency of codon usage 

which more closely resembles the frequency of codon usage of 'the 

plant in which it is to be expressed'" of Claim 11 of Adang's 

U.S. Patent 5,380,831 would appear to read on the step of 

"modifying a portion of said coding sequence, to yield a modified 

sequence which has a frequence of codon usage which is more 

closely resembles the frequency of codon usage of [the nuclear 

genes] 'of .,t,fcie^plant in which it is to be expressed'." According 

to Fischhoff , the frequency of codon usage for the .whole plant is 

likely to differ markedly from the frequency of codon usage of 

the plant's nuclear genes. More particularly, Fischhoff notes 

from Adang's disclosure that "chloroplasts . . . have their own 

genome with different codon frequencies than are founds in 

nuclear genes" (Paper No. 80, p. 14, para. 2). See Adang's U.S. 
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Patent 5,380,831, col. 26, 1. 27-49. In addition, Fischhoff 
argues that the frequency of codon usage varies considerably not 
only from one type of plant to another and from one type of 
tissue to another for the same plant, but also within each cell 
itself (Paper No. 80, p. 14). 

Neither the Delaware district court nor the Court of Appeals 
for the Federal Circuit appear to have interpreted the term 
"plant" in the claims of Adang's patents. Presuming that the 
plain meaning of the term is "the whole plant," the fact that 
Adang's patent specification preferably modifies a portion of the 
coding sequence "to yield a modified sequence which contains a 
greater number of codons preferred by the intended plant host" 

(Claim 1 of Adang's U.S. Patent 5,380,831) based on the Frequency 
of Codon Usage Distribution Fractions compiled from 154 coding 
sequences of nuclear genes does not itself render the subject 
matter claimed vague and indefinite. Adang's specification 
teaches (Adang's U.S. Patent 5,380,831, col. 7, 1. 11-16) 

(emphasis added) : 

[T]he frequency of preferred codon usage exhibited by 
a host cell can be calculated by averaging frequency 
of preferred codon usage in a large number of genes 
expressed by the host cell. It is preferable that 
this analysis be limited to genes that are highly 
expressed by the host cell . 

Again, Adang's patent teaches, "[T]o optimize the efficiency of 

translation, codons preferred in highly expressed proteins of 
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the host cell are utilized" (Adang's U.S. 5,380,831, col. 9, 

1. 52-54). Therefore, it is not surprising that Adang's 

patent specification discloses, as its preferred embodiment, 

modification of a portion of the coding sequence "to yield a 

modified sequence which contains a greater number of codons 

preferred by the intended plant host" (Claim 1 of Adang's U.S. 

Patent 5,380,831) based on the Frequency of Codon Usage 

Distribution Fractions compiled from 154 coding sequences of 

nuclear genes. Claim 11 of Adang's patent specification reflects 

a preference for "modifying a portion of said coding sequence to 

yield a modified sequence which has a frequency of codon usage 

which more closely resembles the frequency of codon usage of the 

plant in which it is to be expressed" (Claim 11 of Adang's U.S. 

Patent 5,380,831). Adang's patent specification teaches (Adang's 

U.S. 5,380,381, col. 26, 1. 18-20 and 46-47): 

In general, the plant codon usage pattern 
more closely resembles that of man and other higher 
eukarYOt:es.,than unicellular organisms. .... 



. . . In general, the chloroplast codon profile 
more closely resembles that of unicellular organisms 

Adang teaches that the codon bias of its nuclear genes best 
reflects the codon bias of a host plant, the whole plant. Adang 
teaches that the codon bias of genes which are highly expressed 

-86- 



Interference 103,781 

in the host plant best reflect the codon bias of the whole plant. 
While the methods of Adang' s patented claims modify a coding 
sequence to yield a modified sequence which contains a greater 
number of codons preferred in the intended plant host, Adang' s 
specification teaches that the codon preference of the whole 
plant is best represented by the codon preferences of nuclear 
genes which are highly expressed in the host plant. 

Third, .while Adang' s patent specification does not 
explicitly define "analyzing the coding sequence of a gene 
derived from Bacillus thuringiensis which encodes an insecticidal 
protein toxin" (Claims 1 and 11 of Adang' s U.S. 5,380,831), it is 
clear that the native B . t . coding sequence must be analyzed to an 
extent necessary for one skilled in the art to modify a portion 
of the native B . t . coding sequence to yield a modified sequence 
which either contains a greater number of codons preferred by the 
intended plant host than did the native B. t . coding sequence 
analyzed or^feas-.-a frequence of codon usage which more closely 
resembles the frequency of codon usage of the plant in which it 
is to be expressed. There is nothing in the second paragraph of 
35 U.S.C. § 112 which indicates that a broad step of a patentable 
method claim cannot be defined by its function as long as the 
claim .language as a whole apprizes persons skilled in the art of 
the full scope of the method claimed. While Fischhoff prima 
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facie criticizes the term "analyzing" in the claimed methods as 
broad enough to encompass any kind of analysis which allows 
persons skilled in the art to modify a portion of the B. t . coding 
sequence as indicated, Fischhoff has not explained why the term 
itself does not satisfy 35 U.S.C. § 112, second paragraph, if the 
purpose of the analysis is clear (Paper No. 80, pp. 18-19) . 
There is nothing inherently wrong with broad claim language. 

Fourth, Fischhoff does not argue that the U.S. District 
Court. for the District of Delaware, U.S. District Court for the 
Southern District of California, or the Court of Appeals for the 
Federal Circuit misinterpreted the methods of Claims 1-12 of 
Adang's U.S. Patent 5,308,831 comprising the steps, of "analyzing 
. . . " and "modifying . . ."as further limited by the functional 
phrase "of designing a synthetic Bacillus thurinaiensis gene to 
be more highly expressed in plants" (Paper No. 80, pp. 19-21). 
On its face, each of the methods of Claims 1 and 11 of Adang's 
U.S. Patent_^., t J.80, 831 comprises two steps which, upon performance 
thereof, are said to effectively design a synthetic Bacillus 
thurinaiensis gene which is more highly expressed in a plant than 
an unmodified native B. t . gene is expressed in the same plant. 
Presuming the steps of the claimed methods adequately define the 
scope and content of the subject matter claimed, how persons 
skilled in the art would measure comparatively higher expression 

-88- 



Interference 103,781 

of synthetic B. t . genes made by the methods of Claims 1-12 of 

Adang' s patent appears to be immaterial to the patentability of 

the claims. Fischhoff has not shown that performance of the two 

steps of each of the claimed methods would not inherently design 

a synthetic B ■ t ■ gene which is more highly expressed in plants 

than the corresponding native B . t . gene. 

Fifth, supported by declaratory evidence accompanied by 

data, Fischhoff argues that the phrase "codons preferred by the 

intended plant host'' in Claim 1 of Adang' s U.S. Patent 5, 380, 831 

is vague and indefinite because the determinations whether codons 

are preferred by the intended plant host depend on the kind and 

number of genes used to determine the codons preferred by the 

plant host and the statistical differences associated with each 

determination (Paper No. 80, pp. 21-22) . Even assuming that 

Fischhof f ' s argument has merit, Fischhoff has not shown that' 

the scope of the claimed subject matter is vague, indefinite, 

and indeterminable. The Delaware district court held: 

[T]he phrase "greater number of codons preferred," 
is satisfied where the newly-created synthetic 
gene has a higher number of those codons whose 
frequency in the native Bt gene was lower than 
their frequency in the intended plant host, 
and where the synthetic gene has an overall 
distribution of codon usage that is closer to 
that of the intended plant host. 

Mvcoaen Plant Sci., Inc. v. Monsanto Co, , 61 F. Supp.2d 199, 215 

(D. Del. 1999) . On review, the Court of Appeals for the Federal 
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Circuit stated, "[T]he district court's claim construction 

regarding the 'greater number of codons preferred' limitation was 

correct." Mycogen Plant Sci., Inc. v. Monsanto Co. , 243 F.3d 

at 1330, 58 USPQ2d at 1041. 

Sixth, Fischhoff argues that the phrase "a frequency 

of codon usage which more closely resembles the frequency 

of codon usage of the plant in which it is to be expressed" 

of Claims 11-12 of Adang' s U.S. Patent 5,380,831 is itself 

indefinite (Paper No. 80, pp. 23-27) .' Initially, Fischhoff asks 

what the phrase "more closely resembles" means? However, the 

Federal Circuit indicated that the Delaware district court 

correctly construed the phase in context as follows: 

[I]t would . . . result in a higher number of those 
codons whose frequency in the native Bt gene was lower 
than their frequency in the intended plant host. 

Id^, 243 F.3d at 1329, 58 USPQ2d at 1041. Fischhoff has not 

explained why the Delaware district court's literal 

interpretation of the quoted claim language is unreasonable. 

Fischhoff acknowledges (Paper No. 80, p. 27, first full para.): 

Another reasonable interpretation of claims 11 
and 12 is that' any one amino acid's codon distribution, 
If changed in the right direction, causes the sequence 
to have a frequency of codon usage which more closely 
resembles the frequency of codon usage in the host. 
On that basis, Dr. Barry concluded that a prior art 
Bt fusion gene disclosed in DeGreve et al. (which 
contained a greater proportion of CCC's (proline) 
than the native Bt gene did) fully meets the literal 
language of the party Adang et al.'s claims .... 
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Fischhoff has not shown that the language of Claims 1-12 of 
Adang's U.S. Patent No. 5,380,831 is vague, indefinite, or 
indeterminable. Fischhoff has not shown that persons skilled in 
the art reasonably would not have understood the scope of the 
subject matter claimed in Adang's patent. Accordingly, 
Fischhoff ' s Preliminary Motion 3 (Paper No. 80) is DENIED . 

M. Fischhoff' s Preliminary Motion 5 (Paper No. 82) 

By Fischhoff s Preliminary Motion 5 (Paper No. 82), 
Fischhoff moves under 37 CFR § 1.633(a) for judgment that 
Claims 1-12 of Adang's U.S. Patent 5,380,831 are unpatentable 
under 35 U.S.C. § 112, first paragraph, as based on a 
specification which would not have enabled one skilled in the art 
to make and use the full scope of the invention claimed. This 
motion is DEFERRED TO FINAL HEARING . 

Substantial portions of Fischhoff s arguments in support of 
its position that the subject matter defined by Claims 1-12 of 
Adang's U.S^Patent 5, 380,831 is broader than the enabling 
disclosure are devoted to the confusion persons skilled in the 
art would have had trying to determine the scope and content of 
the claimed subject matter. As Fischhoff did in its Preliminary 
Motion. 3 (Paper No. 80), Fischhoff argues that the phrases "to 
be more highly expressed in plants" in Claims 1-12 of Adang's 
patent (Paper No. 82, pp. 5-6), "which contains a greater number 
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of codons preferred by the intended plant host" in Claims 1-10 of 
Adang's patent (Paper No. 82, pp. 6-17), and "to yield a modified 
sequence which has a frequency of codon usage which more closely 
resembles the frequency of codon usage of the plant in which it 
is to be expressed" in Claims 11-12 of Adang's patent (Paper, 
No. 82, pp. 17-25), are inadequately defined in the patent's 
specification. Because the phrases purport to be vague and 
indefinite, Fischhoff argues that persons skilled in the art 
would have been required to perform undue experimentation to 
determine how to modify a coding sequence derived from a native 
B.t. gene which encodes an insecticidal protein toxin to be more 
highly expressed in plants. Presumably then, since Fischhoff s 
Preliminary Motion 3 (Paper No. 80) has been denied, this motion 
also should be denied. Not necessarily so. 

In its revised post-trial opinion of September 8, 1999 
( Mvcogen Plant Sci., Inc. v, Monsanto Co. , 61 F. Supp. 2d 199 
(D. Del. 199g,)J : .,, the Delaware district court considered the 
issue whether the claims of Adang's U.S. Patents 5,567,600 and 
5,567,862 are invalid for lack of enablement (Paper No. 125, 
Attachment H, pp. 127-135) . Much of the language of Claims 1-12 
and the disclosure of the specification of Adang's U.S. Patent 
5,380,831 upon which Fischhoff here bases its nonenablement 
arguments are common to the claims and specifications of Adang's 
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U.S. Patents 5,567,600 and 5,567,862. The Delaware district 

court held that the claims of Adang' s U.S. Patents 5,567,600 and 

5,567,862, which include further limitations, are invalid for 

lack of enablement (Paper No. 125, Attachment H, pp. 127-135) . 

The court stated (Paper No. 125, Attachment H, pp. 134-135) : 

Adang and Murray have identified a specific 
methodology for designing a gene by modifying codon 
sequences to reduce certain codons and have set out in 
their specification an example showing how they have 
implemented that methodology to design a synthetic 
Bt gene. In claiming their invention, they have, 
identified the methodology, but have not identified 
which codons or how many should be removed. In light 
of the number of these codons in a gene, there are 
millions and even billions of ways to implement their 
methodology as claimed, almost all of which will not 
achieve the desired result. The example Adang and 
Murray have set out [in (sic)] the specification may 
show an example of how they have implemented their 
methodology, but neither the example nor the 
specifications provides guidance to those skilled in 
the art on how the methodology should be implemented. 
That is, the claims and specifications do not identify 
which and how many codons should be removed. 

Having reviewed and considered the matter, the 
court finds the defendants (Monsanto. Co.) have offered 
clear and convincing evidence which establishes that 
the specifications of the '600 and '8 62 patents are 
not enabling. The court will, therefore, grant the 
defendants motions and enter an order directing the 
clerk to enter judgment in favor of defendants and 
against plaintiff on defendants' counterclaims that 
the claims of the patent are invalid for lack of 
enablement . 

Thereafter, having the issue whether the claims of Adang' s 
U.S.. Patent 5, 380,831 are supported by an enabling disclosure 
before it on appeal from a summary judgment based on the 
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Delaware district court's decision that the narrower claims of 

the '600 and '861 patent are not enabled by their supporting 

specifications, the Federal Circuit remanded the issue to the 

U.S. District Court for the Southern District of California, 

stating in Mvcogen Plant Sci., Inc. v. Monsanto Co. , 252 F.3d 

1306, 1317, 58 USPQ2d 1891, 1899 (Fed. Cir. 2001): 

Monsanto' s argument on appeal is that the claims are 
too broad to be enabled by a specification that provides 
only one example of an embodiment of the invention. The 
specification of the '831 patent, however, includes more 
than just one example: it contains codon usage tables, 
recommendations, on the preferred level of homology, 
and means for calculating deviation of the frequency of 
preferred codon usage. The proper resolution of the 
enablement issue is thus not sufficiently for us to 
direct the district court to enter summary judgment 
on that issue before the district court has addressed 
it. Accordingly, we leave it to the district court to 
determine whether there is a genuine issue of material 
fact as to enablement. 

The issue here is whether Fischhoff has sustained its burden 
to show that the specification of Adang' s U.S. Patent 5,380,831 
would not have enabled persons skilled in the art to make and use 
the full scope'" of the subject matter defined by its Claims 1-12 
without undue experimentation. To decide the issue, we must 
consider all evidence of record indicating whether or not 
modification of a portion of a coding sequence of a gene derived 
from Bt which encodes an insecticidal protein toxin to yield a 
modified sequence which (1) contains a greater number of codons 
preferred by the intended plant host than did the portion of a 
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coding sequence of a native gene derived from Bt which encodes an 
insecticidal protein toxin, and/or (2) has a frequency of codon 
usage which more closely resembles the frequency of codon usage 
of the plant in which it is to be expressed, necessarily designs 
a synthetic Bt gene encoding an insecticidal protein toxin which 
is more highly expressed in plants than a native Bt gene encoding 
an insecticidal protein toxin. It is anticipated that the 
evidence and arguments supporting the parties' cases for priority 
of invention will shed more light on the issue. Accordingly, 
Fischhoff's Preliminary Motion 5 (Paper No. 82) is DEFERRED TO 
FINAL HEARING . 

N. Fischhoff's Preliminary Motion 9 (Paper No. 87) 
By Fischhoff's Preliminary Motion 9 (Paper No. 87), 
Fischhoff moves under 37 CFR § 1.633(a) for judgment that 
Claims 1-12 of Adang's U.S. Patent 5,380,831 are unpatentable 
under 35 U.S. C. § 112, first paragraph, for noncompliance with 
its written description requirement. Fischhoff's arguments in 
support of this motion are best represented by the following 
quotations: 

Claim 1 of the '831 patent recites that the 
modified gene sequence "contains a greater number 
of codons preferred by the intended plant host than 
did said coding sequence [subjected to analysis]." 
The M82 application specification does not contain 
'any reference to or definition of the term "greater 
number of codons preferred by the intended plant 
host." In the absence of any definition of this 
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term, the ordinary meaning of the term "greater 
number of codons" would include any number of 
codons including only one codon. 

(Paper No. 87, p. 12, second full para.); 

Claim 11 of the y 831 patent contains mention 
of "frequency of codon usage" with no mention of 
"preferred" as a characteristic of codon usage. 
The M82 application specification does not contain 
any reference to or definition of the term "frequency 
of codon usage." 

(Paper No. 87, p. 13, first full para.); 

It was not until the amendment dated June 16, 1993 
that applicants Adang et al. added claim 58, among 
others, which recited "a greater number of codons 
preferred by the intended plant host than did said 
coding sequence." New claim 58 was substantially 
identical to claim 1 of the '831 patent. The amendment 
cites no specification support for these new claims, 
but it asserts the following: 

As was discussed during the May 28 interview, 
simply replacing unfavored codons of the native 
Bacillus thuringiensis ( B. t . ) gene with more 
favored codons of the intended plant host 
can often effect a number of the other 
modifications suggested by applicants in the 
specification . 

(Paper No. 87, p. 13, last para.; footnote omitted); 

Adang et al. admitted to what their invention actually 
was in an amendment dated June 30, 1994, during 
prosecution of the x 191 application [ (Adang' s U.S. 
Application 08/057,191)]: 

Applicants discovered that a range of sequence 
characteristics that are undesirable for expression 
in plant hosts could be minimized or eliminated 
by modifying the B. t . gene sequence to reflect 

frequency of codon usage more closely resembling 
that of the intended plant host . [Emphasis added.] 
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Claims 1-10, in contrast, read on making just one 
codon substitution. Claim 12 is even broader in one 
respect because it reads on making just one nucleotide 
substitution and claim 11 is still broader because 
it reads on modifications without any nucleotide , 
■> substitutions at all. 

(Paper No. 87, p. 17; footnote omitted); 

The specification must reasonably convey to one 
skilled in the relevant art that Adang et al. had 
possession of a method for enhancing expression of a 
Bt gene by making a single nucleotide or codon change 
at the time its application was filed. 

Indeed in the office action dated December 11, 
1990, in the M82 application, the examiner noted that 
claims directed to any single factor which was modified 
were not enabled, because modifying any single factor 
was unlikely to produce a useful construct. The 
various factors disclosed by the party Adang et al. 
as modifications which would produce a more highly 
expressed gene included the elimination of CUUGG 
hairpins, CG and TA doublet avoidance, the elimination 
of polyadenylation sequences, the elimination of 
polymerase II termination sequences, and the 
elimination of plant consensus splice sites. The 
party Adang et al. not only failed to provide any 
data supporting the operability of any such single 
modified sequence, but it also failed to note written 
description support in the specification for making 
only one type of modification. Clearly, Adang et al. 
could not point to support in the specification for 
a singi'fc'type of modification comprising a change of 
only one nucleotide. 

(Paper No. 87, pp. 21-22; footnote omitted); and 

. . . [I]n response to the same type of undue 
breadth rejection, the party Adang et al. submitted a 
declaration by Michael G. Murray, Ph.D., which stated: 

The most important consideration is to 
more closely approximate the frequency of 
codon usage of the intended plant host while 
avoiding sequences known to be undesirable 
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to plants. It is obvious that there would be 
a great number of possibilities for modifying 
a native B . t . sequence while accomplishing 
these goals. However, so long as these goals 
are accomplished, the fact that the resulting 
* synthetic sequences differ' would not cause the 

ordinary skilled artisan to doubt that each 
synthetic gene would work as taught by the 
Adang application. 

While it is true that there would be a great number of 
possibilities for modifying a native B. t . gene according 
to the general teachings in the '831 patent, Dr. Murray's 
argument implies that multiple changes are needed to 
"accomplish those goals." In the absence of evidence 
that a small or specific region in the gene is particularly 
important to modify, one would not expect that just one or 
even several modifications would accomplish these goals. 

(Paper No. 87, pp. 22-23; footnote omitted). 

Accordingly, Fischhoff argues (Paper No. 87, p. 24) : 

[T]he specification of the M82 application does not 
reasonably convey to one skilled in the relevant art 
that the applicants Adang et al. had possession of a 
method for modifying Bt sequences comprising a single 
nucleotide or codon change, at the time that the M82 
application was filed. Because they did not have 
support for the recitation of "greater number of 
codons" (claim 1) and "wherein the modification step 
comprises the substitution of at least one nucleotide," 
(claim 12) each of claims 1-10 and 12 is invalid under • 
the first"'* paragraph of 36 (JSC 112. 

Fischhoff has not satisfied its burden to show that it is 
entitled to the relief sought. Fischhoff s motion is denied for 
the reasons stated below. 

Fischhoff criticizes Adang' s written description of the 
invention claimed in terms more pertinent to determinations that 
claims are unpatentable because the full scope of the subject 
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matter claimed therein would not have been enabled by the 
supporting specification. Note that Fischhoff argues that 
"Claim 12 is even broader . . . because it reads on making just 
one nucleotide substitution and claim 11 is still broader because 
it reads on modifications without any nucleotide substitutions at 
all" (Paper No. 87, p. 17); "the examiner noted that claims 
directed to any single factor which was modified were not 
enabled, because modifying any single factor was unlikely to 
produce a useful construct" (Paper No. 87, p. 21); "Adang et al. 
not only failed to provide any data supporting the operability of 
any such single modified sequence, but it also failed to note 
written description support in the specification for making 
only one type of modification. Clearly, Adang et al. could not 
point to support in the specification for a single type of 
modification comprising a change of only one nucleotide" (Paper 
No. 87, pp. 21-22); and, "In the absence of evidence that a 
small or spegiflc region in the gene is particularly important 
to modify, one would not expect that just one or even several 
modifications would accomplish these goals" (Paper No. 87, 
p. 23) . 

On consideration of a movant's case for unpatentability 
under 35 U.S.C. § 112, first paragraph, its written description 
requirement and its enablement requirement should not be 
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confused. The court stated in Vas-Cath, Inc. v. Mahurkar , 
935 F.2d 1555, 1563-1564, 19 USPQ2d 1111, 1116-1117 (Fed. Cir. 
1991) : 

This court in fin rel Wilder f. 736 F.2d 1516, 
1520, 222 USPQ 369, 372 (Fed. Cir. 1984), cert, denied , 
469 U.S. 1209 (1985),] (and the CCPA before it) clearly 
recognized, and we hereby reaffirm, that 35 USC 112, 
first paragraph, requires a "written description of 
the invention" which is separate and distinct from the 
enablement requirement. The purpose of the "written 
description" requirement is broader than to merely 
explain how to "make and use"; the applicant must also 
convey with reasonable clarity to those skilled in the 
art that, as of the filing date sought, he or she was 
in possession of the invention . The invention is, for 
purposes of the "written description" inquiry, whatever 
is now claimed . 

Thus, a specification may convey that it was in possession of a 
broadly claimed invention and satisfy the written description 
requirement of 35 U.S.C. § 112, first paragraph, without 
describing all species the broadly claimed invention encompasses. 
Utter v. Hiracra . 845 F.2d 993, 998, 6 USPQ2d 1709, 1714 (Fed. 
Cir. 1988). "[T]hat a claim may be broader than the specific 
embodiment -disclosed in a specification is in itself of no 
moment." In re Rasmussen , 650 F.2d 1212, 1215, 211 USPQ 323, 326 
(CCPA 1981) . The description requirement of the first paragraph 
of 35 U.S.C. § 112 is satisfied if one skilled in the art would 
have understood from the specification as .a whole that the 
broader claim language defines an invention disclosed. Id. 
at 1216, 211 USPQ at 327, citing In re Smvthe , 480 F.2d 1376, 
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1384, 178 USPQ 279, 285 (CCPA 1973). How close the written 

description must be to the language of the claimed subject 

matter- to comply with § 112 is determined on a case-by-case 

basis. In re Smith , 458 F.2d 1389, 1395, 173 USPQ 679, 683 

(CCPA 1972) . Each case must be decided on its own facts. In 

Driscoll . 562 F.2d 1245, 1250, 195 USPQ 434, 438 (CCPA 1977). 

In this case, Adang's patent specification teaches: 

The invention disclosed herein comprises a chemically 
synthesized gene encoding an insecticidal protein which 
is functionally equivalent to a native insecticidal 
protein of Bt. This synthetic gene is designed to be 
expressed in plants at a level higher than a native 
Bt gene. . . . Preferably, the synthetic gene is at 
least approximately 85% homologous to an insecticidal 
protein gene of Bt. 

(U.S. Patent 5,380,831, col. 3, 1. 56-64); 

In designing synthetic Btt genes of this invention 
for enhanced expression in plants, the DNA sequence of 
the native Btt structural gene is modified in order to 
contain codons preferred by highly expressed plant genes, 
to attain an A+T content in nucleotide base composition 
substantially that found in plants, and also preferably 
to form a plant initiation sequence, and to eliminate 
sequences that cause destabilization, inappropriate 
polyadetfylStion, degradation and termination of RNA and 
to avoid sequences that constitute secondary structure 
hairpins and RNA splice sites. In the synthetic genes, 
codons used to specify a given amino acid are selected 
with regard to the distribution frequency of codon usage 
employed in highly expressed plant genes to specify that 
amino acid. As is appreciated by those skilled in the 
art, the distribution frequency of codon usage utilized 
in the synthetic gene is a determinant of the level of 
expression . Hence, the synthetic gene is designed such 
that its distribution frequency of codon usage deviates, 



-101- 



Interference 103, 781 



preferably, no more than 25% from that of highly expressed 
plant genes and, more preferably, no more than about 10%. 

(U.S. Patent 5,380,831, col. 4, 1. 3-25; emphasis added); 

A structural gene may contain one or more modifications 
in either the coding or the untranslated regions which 
could affect the biological activity or the chemical 
structure* of the expression product, the rate of 
expression or the manner of expression control. 
Such modifications include, but are not limited to, 
mutations, insertions, deletions and substitutions 
of one or more nucleotides . 

(U.S. Patent 5,380,831, col. 5, 1. 66, to col. 6, 1. 5; emphasis 

added) ; 

The synthetic Bt genes of the present invention are not 
considered. to be functionally equivalent to native Bt 
genes, since they are expressible at a higher level in 
plants than native Bt genes. 

(U.S. Patent 5,380,831, col. 6, 1. 64-68); 

When synthesizing a gene for improved expression 
in a host cell it is desirable to design the gene such 
that its frequency of codon usage approaches the 
frequency of preferred codon usage of the host cell. 

(U.S. Patent 5,380,831, col. 7, 1. 25-28; emphasis added); 

The term designed to be highly expressed as used 
he re in* defers to a level of expression of a designed 
gene wherein the amount of its specific mRNA transcripts' 
produced is sufficient to be quantified in Northern 
blots and, thus, represents a level of specific mRNA 
expressed corresponding to greater than or equal to 
approximately 0.001% of the poly (A) + mRNA. To date, 
natural Bt genes are transcribed at a level wherein 
the amount of specific mRNA produced is insufficient 
to be estimated using the Northern blot technique. 

(U.S. Patent 5,380,831, col. S, 1. 8-17); 
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Thus, this invention is based on the recognition 
that expression levels of desired, recombinant 
insecticidal protein in transgenic plants can be 
improved via increased expression of stabilized mRNA 
transcripts; and that, conversely, detection of these 
stabilized RNA transcripts maybe utilized to measure 
expression of translational product (protein) . 

(U.S. Patent 5,380,831, col. 8, 1. 54-60); and 

Experimental evidence obtained from point mutations 
and deletion analysis has indicated that in eukarvotic 
genes specific sequences are associated with post- 
transcriptional processing, RNA destabilization , 
translational termination, intron' splicing and the 
like . These are preferably employed in the synthetic 
genes of this invention. In designing a bacterial gene 
for expression in plants, sequences which interfere with 
the efficacy of gene expression are eliminated. 

(U.S. Patent 5,380,831, col. 10, 1. 11-19; emphasis added). 

While the above-quoted portions from Adang's U.S. Patent 

5,380,831 indicate that point mutations in the native Bt coding 

sequence for insecticidal protein and/or a synthetic modification 

of the Bt coding sequence for insecticidal protein wherein the 

modification comprising the substitution and/or deletion of one 

nucleotide jji ,~the native Bt coding sequence may design a 

synthetic Bt gene to be more highly expressed in a plant, 

Fischhoff questions whether Adang's disclosure would have led 

persons skilled in the, art to understand that Adang's invention 

comprised a method of designing a native Bt coding sequence to be 

more, highly expressed in a plant comprising a point mutation 

and/or substitution or deletion of one nucleotide. A finding 
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that persons skilled in the art would have believed that, at the 

time its patented application was filed, Adang's invention 

comprised a method of designing native Bt coding sequence to be 

more highly expressed in a plant comprising a point mutation 

and/or substitution or deletion of one nucleotide, is supported 

by the following passages (Adang's U.S. Patent 5,380,831, 

col. 12, 1. 43, to col. 12, 1. 21; emphasis added): 

In designing a synthetic gene for expression in 
plants, attempts are also made to eliminate sequences 
which interfere with the efficacy of gene expression. 
Sequences such as the plant polyadenylation signals, 
. . . polymerase II termination sequence, . . . 
UCUUCGG hairpins and plant consensus splice sites 
are highlighted and, if present in the native Btt 
coding sequence, are modified to eliminate potentially 
deleterious sequences . 

Modifications in nucleotide sequence of the Btt 
coding region are also preferably made to reduce the 
A+T content in DNA base composition. . . . Since A+T- 
rich regions typify plant intergenic regions and 
plant regulatory regions, it is deemed prudent to 
reduce the A+T content 

Also, a single modification (to introduce 
guanine in lieu of adenine) at the fourth nucleotide 
positiorf'Tir the Btt coding sequence is made in the 
preferred embodiment to form a sequence consonant 
with that believed to function as a plant initiation 
sequence (Taylor et al. (1987) Mol. Gen. Genet. 210: 
572-577) in optimization of expression . . . . 

Not all of the above-mentioned modifications 
of the natural Bt gene must be made in constructing 
a synthetic Bt gene in order to obtain enhanced 
expression . For example, a synthetic gene may be 
synthesized for other purposes in addition to that 
of achieving enhanced levels of expression. Under 
these conditions, the original sequence of the 
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natural Bt gene may be preserved within a region 
of DNA corresponding to one or more, but not all, 
segments used to construct the synthetic gene. 
Depending on the desired purpose of the gene, 
modification may encompass substitution of one 
or more, but not all, of the oligonucleotide 
segments used to construct the synthetic gene 
by a corresponding region of natural Bt sequence. 

Adang's patent specifies (U.S. Patent 5,380,831, col. 15, 

1. 30-55) : 

This invention combines the specific teachings of 
the present disclosure with a variety of techniques and 
expedients known in the art. The choice of expedients 
depends on variables such as the choice of insecticidal 
protein from a Bt strain, the extent of modification 
in preferred codon usage, manipulation of sequences 
considered to be destabilizing to RNA or sequences 
prematurely terminating transcription, insertions of 
restriction sites within the design of the synthetic 
' gene to allow future nucleotide modifications, additions 
of introns or enhancer sequences to the 5 ! and/or 3 1 ends 
of the synthetic gene, the promoter region, the host in 
which a promoter region/structural gene combination is 
expressed, and the like. . . . The fundamental aspect 
of the present invention is the ability to synthesize 
a novel gene coding for an insecticidal protein, 
designed so that the protein will be expressed at an 
enhanced level in plants, yet so that it will retain 
its inherent property of insect toxicity and retain 
or increase its specific insecticidal activity. 

Persons skilled in the art would have understood from the 

teaching of Adang's patent specification as a whole that the 

•invention described therein minimally comprises a single 

modification of a native Bt gene encoding a insecticidal protein, 

e.g., the introduction, deletion or substitution at a single 
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nucleotide position, to enhance expression of the insecticidal 
protein in a plant by a detectable amount of mRNA transcripts. 

The more debatable issue is, as Fischhoff suggests, whether 
Adang's patent specification would have enabled persons skilled 
in the art to make and use the full scope of the invention 
described and claimed. That issue is deferred to final hearing. 
Fischhoff has not otherwise shown that Adang's patent 
specification does not satisfy the written description of 
35 U.S.C. § 112, first paragraph, for the subject matter claimed. 
Accordingly, Fischhoff s Preliminary Motion 9 (Paper No. 87) is 
DENIED . 

0. Fischhoff s Preliminary Motion 7 (Paper No. 85) 
By Fischhoff s Preliminary Motion 7 (Paper No. 85), 
Fischhoff moves under 37 CFR § 1.633(a) for judgment that 
Claims 1-12 of Adang's U.S. Patent 5,380,831 (FX 11), issued 
January 10, 1995, are unpatentable under 35 U.S.C. § 102 or 
§ 103 (Paper ^N-O...,. 85, p. 1) . Consideration of this motion is 
deferred to final hearing. 

When Fischhoff filed its motion (Paper No. 85), Adang 
opposed the motion (Paper No. 68), and Fischhoff replied (Paper 
No. 107), neither party to this interference appears to have had 
the benefit of the interpretations of terms in its claims by 
the U.S. District Courts for the District of Delaware and the 
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Southern District of California and the decisions of the U.S. 
Court of Appeals for the Federal Circuit in review thereof. It 
is improper to consider the patentability of claimed subject 
matter under the first paragraph of 35 U.S.C. § 112, 35 U.S.C. 
§ 102, and 35 U.S.C. § 103 before the scope and content of the 
subject matter claimed is understood. See Panduit Corp. v. 
Dennison Mfg. , 810 F.2d 1561, 1567-1568, 1 USPQ2d 1593, 1597 
(Fed. Cir.), cert, denied , 481 U.S. 1052 ( 1987 )( footnote 
omitted) : 

Analysis begins with a key legal question - what 
is the invention claimed ? Courts are required to view 
the claimed invention as a whole . 35 U.S.C. § 103. 
Claim interpretation, in light of the specification, 
claim language, other claims, and prosecution history, 
is a matter of law, and will normally control the 
remainder of the decisional process. 

See also In re Geerdes , 491 F.2d 1260, 1262, 180 USPQ 789, 791 

(CCPA 1974) (" [b] efore considering the rejections under 35 U.S.C. 

103 and 112, we must first decide [what] . . . the claims include 

in their scope"); In re Moore , 439 F.2d 1232, 1235, 169 USPQ 236, 

238 (CCPA 1971) ("the claims must be analyzed first in order to 

determine exactly what subject matter they encompass") ; and In re 

Steele , 305 F.2d 859, 862, 134 USPQ 292, 295 (CCPA 1962): 

We do not think a rejection under 35 U.S.C. 103 should 
be based on . . . speculations and assumptions. We 
think [it] . . . wrong in relying on what at best are 
speculative assumptions as to the meaning of the claims 
and basing a rejection under 35 U.S.C. 103 thereon. 
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Along with filing this motion, Fischhoff moved under 37 CFR 

§ 1.633(a) for judgment that the Adang's claims designated as 

corresponding to the count do not distinctly claim and 

particularly define the subject matter Adang regards as its 

invention (Fischhoff s Preliminary Motion 3 (Paper No. 80)). 

Therein, Fischhoff argued that the subject matter defined by 

Adang's claims is vague and indefinite. Considering that view, 

it must be assumed that the respective- positions of the parties 

supporting and opposing this motion are based on speculation. 

Accordingly, it is most appropriate, informative, and efficient 

at this time to defer consideration of the issues presented by 

this motion until after the parties have' had ample opportunities 

to comprehend the Federal Circuit's review of district courts' 

decisions interpreting language common to, and the scope and 

content of, the parties' claims designated as corresponding to 

the count, and reconsider and support their respective positions 

in that light. 

Another key preliminary legal inquiry is - what 
is the prior art? ["Under 35 U.S.C. § 103, the scope 
and content of the prior art are to be determined; 
differences between the prior art and the claims at 
issue are to be ascertained; and the level of skill in 
the pertinent art resolved. Against this background, 
the obviousness or nonobviousness of the subject matter 
is determined." Graham v. John Deere Co. , 383 U.S. 1, 
17-18 (1966).] Before answering Graham' s "content" 
inquiry, it must be known whether a patent or 
publication is in the prior art under 35 U.S.C. 
§ 102, - a legal question. 
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Panduit Corp. v. Dennison Mfg. . 810 F.2d at 1568, 1 USPQ2d 
at 1597. The primary purpose of an interference proceeding is to 
determine priority of a commonly claimed invention, i.e., what 
constitutes prior art under 35 U.S.C. § 102(g). Accordingly, the 
issue whether Adang' s claims are unpatentable under 35 U.S.C. 
§ 103 cannot and should not be decided until the full scope and 
content of the prior art have been determined; the differences 
between the prior art and the claims at issue have been 
ascertained; and the level of skill in the pertinent art is 
resolved. Accordingly, Fischhoff Preliminary Motion 7 (Paper 
No. 85) is DEFERRED TO FINAL HEARING . 

P. Fischhoff 's 37. CFR § 1.641(a) Request (Paper No. 110) 
By Fischhoff s 37 CFR § 1.641(a) Request (Paper No. 110), 
Fischhoff asks the APJ "to set a time period within which each 
party may develop testimony and present arguments and evidence 
on the apparent invalidity of" Claims 1-12 of Adang' s U.S. 
Patent 5, 38Q y .,8.3.1. under 35 U.S.C. § 112, first paragraph, for 
noncompliance with its best mode requirement (Paper No. 110, 
pp. 6-7, bridging para.). As support therefor, Fischhoff argues 
that the APJ, exercising discretion which is permissible under 
37 CFR § 1.641(a), should "conclude" that Claims 1-12 of Adang' s 
U..S. Patent 5, 380,831 are invalid for noncompliance with the 
best mode requirement of 35 U.S.C. § 112, first paragraph. 
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According to Fischhoff, it is "apparent" from the evidence that 
Adang's patent specification does not satisfy the best mode 
requirement of 35 U.S.C. § 112, first paragraph (Paper No. 110, 
pp. 6-7, bridging para.). Nevertheless, given that Fischhoff 
itself has not moved under 37 CFR 1.633(a) for judgment that 
Claims 1-12 of Adang's U.S. Patent 5,380,831 are unpatentable 
under 35 U.S.C. § 112, first paragraph, it appears that (1) the 
evidence to which Fischhoff points may not be sufficient to 
establish the invalidity of Claims 1-12 of Adang's U.S. 
Patent 5,380,831 under 35 U.S.C. § 112, first paragraph (best 
mode requirement), and (2) Fischhoff itself does not consider the 
evidence sufficient to establish invalidity under 35 U.S.C. 
§ 112, first paragraph. Indeed, Fischhof f ' s request is 
confusing . 

In Eli Lilly & Co. v. Barr Labs. , 251 F.3d 955, 58 USPQ2d 

1869 (Fed. Cir. 2001), the court said at 963, 58 USPQ2d at 1874: 

Our case law explicating the best mode requirement 
f ocuses''W''a two-prong inquiry. . . . First, the 
factfinder must determine whether, at the time of 
filing the application, the inventor possessed a best 
mode for practicing the invention. . . . Second, if 
the inventor possessed a best mode the factfinder must 
determine whether the written description disclosed 
the best mode such that one reasonably skilled in the 
art could practice it. . . . The first prong involves 
a subjective inquiry, focusing on the inventor's state 
of mind at the time of filing. . . . The second prong 
involves an objective inquiry, focusing on the scope 
of the claimed invention and the level of skill in the 
art. . . . 
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With respect to the second prong of the best mode 
requirement, the extent of information that an inventor 
must disclose depends on the scope of the claimed 
invention 

In this case, it appears that the synthetic genes comprising DNA 

sequences of Figure 1 of Adang's U.S. Patent 5,380,831 which span 

nucleotides 1 through 1793 and 1 through 1833 represent specific 

examples of useful products which can be made by Adang's 

generally claimed method of designing a synthetic Bt gene to be 

more highly expressed in plants. However, Adang's specification 

makes clear that useful products made in accordance with the 

claimed invention are much broader in .scope than the patent 

specification's examples, and the synthetic Bt genes comprising 

the DNA sequences depicted in Figure 1 and defined by Claims 13 

and 14 of Adang's patent do not necessarily represent the best 

mode. of designing a synthetic Bt gene to be more highly expressed 

either in plants in general or in any particular kind of plant. 

Adang's patent specification teaches (Adang's U.S. Patent 

5, 380, 831, -col? ""15, 1. 30-55): 

This invention combines the specific teachings of 
the present disclosure with a variety of techniques and 
expedients known in the art. The choice of expedients 
depends on variables such as the choice of insecticidal 
protein from a Bt strain, the extent of modification 
in preferred codon usage, manipulation of sequences 
considered to be destabilizing to RNA or sequences 
prematurely terminating transcription, insertions of 
restriction sites within the design of the synthetic 
gene to allow future nucleotide modifications, addition 
of introns or enhancer sequences to the 5' and/or 
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3' ends of the synthetic structural gene, the promoter 
region, the host in which a promoter region/structural 
gene combination is expressed, and the like. As novel 
insecticidal proteins and toxic polypeptides are 
discovered, and as sequences responsible for enhanced 
■>■ cross-expression (expression of a foreign structural 

gene in a given host) are elucidated, those of ordinary 
skill will be able to select among those elements to 
produce "improved" synthetic genes for desired proteins 
having agronomic value. The fundamental aspect of the 
present invention is the ability to synthesize a novel 
gene coding for an insecticidal protein, designed so 
that the protein will be expressed at an enhanced level 
in plants, yet so that it will retain its inherent 
property of insect toxicity and retain or increase 
its specific insecticidal activity. 

From the above quotation from Adang' s patent, it is not at 
all apparent that Adang possessed a best mode generally for 
practicing the claimed invention at the time its patent 
application was filed. It is even less apparent that the 
invention Adang claims can have a best mode without specifying 
the type of plant in which the synthetic Bt gene is to be highly 
expressed, the particular extent and/or kind of insect 
infestation, the particular toxin the Bt gene is to encode, the 
particular -amount and/or strength of the toxin the plant must 
produce relative to the kind of infestation and plant, etc. . 
Given the number of expedients influencing the design of the 
synthetic Bt gene to be produced by the methods generally claimed 
in Adang's patent, it is not apparent that Adang possessed a best 
mode of practicing the method generally claimed. 
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Moreover, in its broadest aspects, the methods claimed in 
Adang' s patent specify that the native Bt gene is to be modified 
to yield a modified sequence which contains a greater number of 
codons preferred by the intended plant host" (Claim 1 of Adang' s 
U. S. Patent 5,380,831) or to have a frequency of codon usage 
more closely resembling the frequency of codon usage of the plant 
in which it is to be expressed (Claim 11 of Adang' s U. S. Patent 
5,380,831). It is not apparent that the examples in Adang' s 
patent do not set forth the best mode of carrying out the subject 
matter Adang most broadly claims to be its invention. 

Since it is not apparent from the evidence to which 
Fischhoff points that Claims 1-12 of Adang' s patent are 
unpatentable under 35 U.S.C. § 112, first paragraph (best mode 
requirement) , there is no reason for the APJ to exercise 
discretion permissible under 37 CFR § 1.641(a) and "conclude" 
that Claims 1-12 of Adang' s U.S. Patent 5,380,831 are invalid for 
noncompliance,., .with the best mode requirement of 35 U.S.C. § 112, 
first paragraph, or- "to set a time period within which each party 
may develop testimony and present arguments and evidence . . . 
[relative to] the apparent invalidity of" Claims 1-12 of Adang' s 
U.S. Patent 5,380,831 under 35 U.S.C. § 112, first paragraph, for 
noncompliance with its best mode requirement. Fischhoff s 37 CFR 
§ 1.641(a) Request (Paper No. 110) is DENIED . 
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Q. Fischhoff s Motion Under 37 CFR § 1.635 
For Temporary Stay Of Interference 
Under 37 CFR S 1 . 645 (d) ( Paper No. 118) 

Fischhoff moves for a temporary stay of this interference 

proceeding pending the decision of the U.S. District Court for 

the Southern District of California in Mvcoaen Plant Sci., Inc., 

and Agriqenetic's, Inc. v. Monsanto Co. , Case No. 95-0653 j, on 

Monsanto' s motion for summary judgment that all claims of Adang' s 

U.S. Patent 5,380,831 are invalid under 35 U.S.C. §§ 102(g) and 

103 because of prior invention thereof by Fischhoff. In that 

case, Mycogen Plant Science Inc. and Agrigenetics Inc. sued 

Monsanto Company for infringement of its patent (Adang et al., 

U.S. Patent 5,380,831, issued January 10, 1995, from U.S. 

Application 08/057,191, filed May 3, 1993). The U.S. District 

Court for the Southern District of California entered an order 

( Mvcoaen Plant Sci., Inc. v. Monsanto - Co . , No. 95-CV-653 (S.D. 

Cal. Nov. 10, 1999) (Paper No. 127, Exh. A)) granting defendant's 

motion for nummary judgment that Claims 1-12 of Mycogen' s '831 

patent are invalid under 35 U.S.C. § 102(g) and/or § 103 because 

Monsanto invented the subject matter thereof before Mycogen, as 

determined by the U.S. District Court for the District of 

Delaware in Mvcoaen Plant Sci., Inc. v. Monsanto Co. , 61 F. Supp. 

2d 199 (D. Del. 1999), which was affirmed in Mvcoaen Plant Sci . , 

Inc. v. Monsanto Inc. . 243 F.3d 1316, 58 USPQ2d 1030 (Fed. Cir. 
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2001), and denied defendant's motion for summary judgment that 

the contested claims of Mycogen' s '831 patent are invalid for 

noncompliance with the enablement requirement of the first 

paragraph of 35 U.S.C. § 112 as moot (Paper No, 127, Exh. A). 

On appeal from the decision of the U.S. District Court for 

the Southern District of California on motion for summary 

judgment in Mvcoaen Plant Sci., Inc. v. Monsanto Co. . 

No. 95-CV-653 (S.D. Cal. Nov. 10, 1999) ( Paper No . 127, Exh. A), 

the U.S. Court of Appeals for the Federal Circuit affirmed-in- 

part, reversed-in-part, and remanded. Mvcoaen Plant Sci., Inc. 

v. Monsanto Co. , 252 F.3d 1306, 1309, 58 USPQ2d 1891, 1892-1893 

(Fed. Cir. 2001).. The Federal Circuit concluded at 1309, 

58 USPQ2d at 1893, that: 

. . . the district court improperly resolved disputed 
questions of material fact pertaining to the issue of 
prior invention, and we therefore reverse the court's 
ruling on summary judgment that the '831 patent is 
invalid under 35 U.S.C. § 102(g). We decline to affirm 
the summary judgment of invalidity on the alternative 
ground of non-enablement , as urged by Monsanto, but 
leave 'tS'^fre district court the task of determining in 
the first instance whether there is a genuine issue of 
material fact as to enablement based on its assessment 
of the evidence presented to it in the summary judgment 
proceeding. 

Id^ at 1310, 58 USPQ2d at 1894, the Federal Circuit explained: 

We agree with the district court that collateral 
estoppel requires the court to conclude that Monsanto 
reduced the invention [claimed in the Mycogen's '831 
patent] to practice before Mycogen, and that collateral 
estoppel does not resolve the question whether Mycogen 
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was the first to conceive and then was diligent during 
the critical period. On the merits of the summary 
judgment question, however, we do not agree that 
Monsanto has met its burden of showing that there 
are no issues of material fact regarding whether 
Mycogen was the first to conceive the invention and 
then diligently reduce it to practice. 

Because Fischhoff moves for a temporary stay of this 

interference proceeding pending the decision of the U.S. District 

Court for the Southern District of California in Mycogen Plant 

Sci . , -Inc., and Aariaenetics , Inc. v. Monsanto Co. , Case 

No. 95-0653j, on Monsanto' s motion for summary judgment that 

all claims of Adang's U.S. Patent 5,380,831 are invalid under 

35 U.S.C. §§ 102(g) and 103 because of prior invention thereof 

by Fischhoff, the decision of the U.S. District Court for the 

Southern District of California on Monsanto' s motion for summary 

judgment that all claims of Adang's U.S. Patent 5,380,831 are 

invalid under 35 U.S.C. §§ 102(g) and 103 was entered 

November 10, 1999, and the district court's ruling on summary 

judgment that,, ..the '8 31 patent is invalid under 35 U.S.C. 

§ 102(g)/103 was reversed on appeal to the U.S. Court of Appeals 

for the Federal Circuit in* Mycogen Plant Sci., Inc. v. Monsanto 

Co^, 252 F.3d 1306, 1309, 58 USPQ2d 1891, .1892-1893 (Fed. Cir. 

2001), Fischhoff s motion for a temporary stay of this 

interference proceeding pending the decision of the U.S. District 

Court for the Southern District of California in Mycogen Plant 
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Sci., Inc., and Aarigenetics , Inc. v. Monsanto Co. , Case 

No. 95-0653 j, on Monsanto' s motion for summary judgment that 

all claims of Adang's U.S. Patent 5,380,831 are invalid under 

35 U.S.C. §§ 102(g) and 103 because of prior invention thereof by 

Fischhoff (Paper No. 118) is DISMISSED . 

R. Fischhoff s Motion Under 37 CFR § 1.635 
For Temporary Stay Of Interference 
Under 37 CFR § 1.645(d) Pending Outcome 
Of Federal Circuit Appeal Of The District 
Court Judgment Of Invalidity Of Claims Of 
The Involved Adang Patent Based On Prior 
Invention By Fischhoff (Paper No. 127) 

The decision of the U.S. District Court for the Southern 

District of California on Monsanto' s motion for summary judgment 

that all claims of Adang's U.S. Patent 5,380,831 are invalid 

under 35 U.S.C. §§ 102(g) and 103 was entered November 10, 1999, 

and the district court's ruling on summary judgment that the A 831 

patent is invalid under 35 U.S.C. § 102(g)/103 was reversed on 

appeal to the U.S. Court of Appeals for the Federal Circuit in 

Mvcooen Plant Sci., Inc. v. Monsanto Co. , 252 F.3d 1306, 

58 USPQ2d 1891 (Fed. Cir. 2001). Thus, Fischhof f ' s motion for a 

temporary stay is moot. Fischhoff s motion (Paper No. 127) is 

DISMISSED . 

4 . Summary of Decisions on Outstanding Requests and Motions 

A. Adang Preliminary Motion 1 (Paper No. 45) 
DENIED ; 

B. Adang Preliminary Motion 2 (Paper No. 4 6) 
DISMISSED ; 
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C. Adang Preliminary Motion 3 (Paper No. 47) 
DENIED ; 

D. (1) Fischhoff First Rule 642 Request (Paper No. 78) 

DISMISSED ; 

(1) Fischhoff Second Rule 642 Request (Paper No. 79) 
DISMISSED ; 

E. Fischhoff Preliminary Motion 4 (Paper No. 81) 
DENIED ; 

F. Fischhoff Preliminary Motion 8 (Paper No. 86) 
DISMISSED ; 

G. Fischhoff Preliminary Motion 11 (Paper No. 89) 
DISMISSED ; 

H. Fischhoff Preliminary Motion 12 (Paper No. 60) 
DISMISSED ; 

I. Adang' s Miscellaneous Rule 635 Motion (Paper No. 116) 
GRANTED ; 

J. Fischhoff Preliminary Motion 6 (Paper No. 83) 
DENIED ; 

K. Fischhoff Preliminary Motion 10 (Paper No. 88) 

DEFERRED TO FINAL HEARING ; 
L. Fischhoff Preliminary Motion 3 (Paper No. 80) 

DENIED ; 

M. Fischhoff Preliminary Motion 5 (Paper No. 82) 

DEFERRED TO FINAL HEARING ; 
N. Fischhoff Preliminary Motion 9 (Paper No. 87) 

DENIED ; 

0. Fischhoff Preliminary Motion 7 (Paper No. 85) 

DEFERRED TO FINAL HEARING ; 
P. Fischhoff Rule 641 Request (Paper No. 110) 

DENIED ; 

Q. Fischhoff Miscellaneous Rule 635 Motion (Paper No. 118) 
DISMISSED ; 

R. Fischhoff Miscellaneous Rule 635 Motion (Paper No. 118) 
DISMISSED. 
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5 . Order 

This interference has been redeclared as Barton (U.S. 
Application 07/827,906) or Fischhoff (U.S. Application 
08/434,105) v. Adana (U.S. Patent 5,380,831) . New Count 2 
defines the interfering subject matter: 

Count 2 

Any one of Claims 1-4, 7, and 15-22 of Barton et al.'s 
Application 07/827,906, filed January 30, 1992; 

- or - 

Any one of Claims 3, 5, and 39-43 of Fischhoff et al.'s 
Application 07/827,906, filed January 30, 1992; 

- or - 

Any one of Claims 1-14 of Adang et al.'s 

U.S. Patent 5,380,831, which issued January 10, 1995, 
from U.S. Application 08/057,191, filed May 3, 1993. 

The claims of the parties which have been designated as 

corresponding to this count are: 

Barton: Claims 1-4, 7, and 15-22 

Fischhoff-:..., Claims 3, 5, and 39-43 

Adang: Claims 1-14. . 

In addition, this Decision and Order on Preliminary and 
Miscellaneous Motions and Reguests invites the parties to 
consider the relationship of the subject matter defined by 
Count 2 of this interference to subject matter claimed in 
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Mycogen's U.S. Patents 6,013,523 and 6,015,891 and comment 
thereon . 

Accordingly, it is 

ORDERED that no later than thirty (30) days from the date of 
this Decision and Order on Preliminary and Miscellaneous Motions 
and Requests, the parties to this interference shall specify 
whether the time for filing preliminary motions should be 
extended; 

FURTHER ORDERED that no later than thirty (30) days from the 
date of this Decision and Order on Preliminary and Miscellaneous 
Motions and Requests, the parties to this interference shall 
specify what additional preliminary motions, if any, and 
supporting evidence, if any, need be filed in this newly 
declared interference; 

FURTHER ORDERED that no later than thirty (30) days from the 
date of this Decision and Order on Preliminary and Miscellaneous 
Motions and^JRequests, the parties to this interference shall 
explain why the additional preliminary motions and supporting 
evidence specified are necessary to, and should be filed in, this 
interference proceeding, and 

FURTHER ORDERED that no later than thirty (30) days from the 
date of this Decision and Order on Preliminary and Miscellaneous 
Motions and Requests, the parties to this interference shall 
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recommend time periods for filing the specified additional 
preliminary motions, supporting evidence, oppositions, replies, 
motions to suppress evidence, etc. 



Teddy S. C^/on 

Administrative Patent Judge 
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Lawrence M. Lavin, Jr., Esq. 
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Lawrence "M. Lavin, Jr., Esq. 
Monsanto Company 
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COUNSEL FOR SENIOR PARTY ADANG et al. 

Oliver R. Ashe, Jr., Esq. 
Greenblum & Bernstein, P.L.C. 
1941 Roland Clarke Place 
Reston, VA 20191 



Interference 103,781 



APPENDIX A 



-123- 



United States Patent u 9] 

Adang et al. 



IMIlIlllUllHl 

US006015891A 

[ii] Patent Number: 6,015,891 
[45] Date of Patent: *Jan. 18, 2000 



[54] SYNTHETIC INSECTICIDAL CRYSTAL 
PROTEIN GENE HAVING A MODIFIED 
FREQUENCY OF CODON USAGE 

[75] Inventors: Michael J. Adang, Athens, Ga.; 

Elizabeth E. Murray, Madison, Wis. 

[73] Assignee: Mycogen Plant Science, Inc., San 

Diego, Calif. 

[ * ] Notice: palent ^ sub j ect lo a lermma i dis- 

claimer. 

[21] Appl. No.: 08/705,438 
[22] Filed: Aug. 29, 1996 

Related VS. Application Data 

[60] Division of application No. 08/369,835, Jan. 6, 1995, Pat. 
No. 5,567,600, which is a continuation-in-part of application 
No. 08/057,191, May 3, 1993, Pat. No. 5,380,831, which is 
a continuation of application No. 07/827,844, Jan. 28, 1992, 
abandoned, which is a continuation of application No! 
07/242,482, Sep. 9, 1988, abandoned. 

[51] Int. CI. 7 C12N 15/00; C12N 15/32; 

C12N 15/82 

[52] U.S. CI 536/23.71; 435/440; 435/468 

[58] Field of Search 536/23.71; 435/172.3, 

435/320.1, 69.1, 419, 468, 440; 800/205, 

279, 302 

[56] References Cited 

U.S. PATENT DOCUMENTS 

4,356,270 10/1982 Itakura 435/317 

4,888,282 12/1989 Beremand et al 435/193 

5,254,799 10/1993 De Greve 800/205 

5,276,268 1/1994 Strauch et al 800/205 

5,380,831 1/1995 Adang et al 536/23.71 

5,567,600 10/1996 Adang et al 536/23.71 

FOREIGN PATENT DOCUMENTS 
0305275 3/1989 European Pat. Off. . 

OTHER PUBLICATIONS 

Sekar, V. et al. (1987) "Molecular cloning and characteriza- 
tion of the insecticidal crystal protein gene of Bacillus 
thuringiensis var. tenebrionis" Pro. Natl. Acad. Sci. USA 
84:7036-7040. 

McPherson, S. A. et al. (1988) "Characterization of the 
Colopteran-specific protein gene of Bacillus thuringiensis 
var. Tenebrionis" Biotechnology 6(l):61-6\ 
Vaeck, M. et al. (1987) "Transgenic Plants protected from 
insect attach" Nature 328:33-37. 

Murray, E. E. et al. (1989) "Codon usage in plant genes" 
Nucleic Acids Research 17(2):477-498. 
Perlak, F. J. et al. (1991) "Modification of the coding 
sequence enhances plant expression of insect control protein 
genes" Proc. Natl. Acad. Sci. USA 88:3324-3328. 
Hernastadt et al. (1986) Biotechnology 4:305-308. 
Sekar et al. (1987) Proc. Natl. Acad. Sci. 84:7036-7040. 
McPherson et al. (1988) Biotechnology 6:61-66. 



Vaeck et al. (1987) Nature 328:33-37. 

Barton el al. (1987) Plant Physiol. 85: 1103-1109. 

Hofie et al. (1988) FEBS Lett. 226:364-370. 

Bennetzen and Hall (1982) J. Biol. Chero. 257:3026-3031. 

Hoekema et al. (1987) Mol. Cell. Biol! 7:2914-2924. 

Boudraa (1987) Genet. Sel. Evol. 19:143-154. 

Grantham et al. (1986) Oxford Surveys in Evol. Biol. 

3:48-81. 

Taylor et al. (1987) Mol. Gen. Genet. 210:572-577. 
Joshi (1987) Nucl. Acids Res. 15:9627-9640. 
Vankan and Fillipowicz (1988) Embo J. 7:791-799. 
Tuerk et al. (1988) Proc. Natl. Acad. Sci. 85:1364-1368. 
Krieg et al. (1983) Z. Angew. Entomol. 96:500-508. 
Bernahard (1986) FEMS Microbiol. Lett. 33:261-265. 
Neili et al. (1987) Gene 55:303-317. 
Rothstein et al. (1987) Gene 55:353-356. 
Coraggio et al. (1986) EMBO J. 5:459-465. 
Fuzakawa et al. (1987) FEBS Lett. 224:125-127. 
Vies et al. (1986) EMBO J. 5:2439-2444. 
Gatenby et al. (1987) Eur. J. Biochem. 168:227-231. 
Viotti et al. (1978) Biochim. Biophys. Acta 517:125-132. 
Hames and Higgins (cds) (1985) Nucleic Acid Hybridisa- 
tion, IRL Press, Oxford, UKe. 

Caruthers, M. (1983) In Methodology of DNA and RNA 
Sequencing, Chapter 1, Weissman (ed.), Praeger Publishers, 
New Yor.I. 

Lo et al. (1984) Proc. Natl. Acad. Sci. 81:2285-2289. 
Grantham et al. (1985) Bull. inst. Pasteur 83:95-148. 
Matsuoka et al. (1987) J. Biochem. 102:673-676. 
Mandecki et al. (1985) Proc. Natl. Acad. Sci. 82:3543-3547. 
Feretti et al. (1986) Proc. Natl. Acad. Sci. 83:599-603. 
Paszkowski, J. et al. (1984) EMBO J. 3: 2717. 
Frbmm, M. et al. (1985) Proc. Natl. Acad. Sci. USA 
82:5824. 

Crossway, A. et al. (1986) Mol. Gen. Genet. 202:179. 
Hooykaas-Van Slogteren, G. et al. (1984) Nature 311:763. 
Dandekar, A. et al. (1987) Biotechnology 5:587. 
An, G. et al. (1985) EMBO J. 4:277. 
Herrera-Estrella, L. et al. (1983) Nature 303:209E. 
Herrera-Estrella, L. et al. (1983) EMBO J. 2:987. 
Herrera-Estrella, L. et al. (1985) in Plant Genetic Engineer- 
ing, Cambridge University Press, New York, p. 63. 
asadaban, M. J. and Cohen, S.N. (1980) J. Mol. Biol. 
138:179-207. 

Marsh, J. L. et al. (1984) Gene 32:481^*85. 
Kronstad et al (1983) J. Bacteriol. 154: 419-428. 
Adang et al. (1987) in Biotechnology in Invertebrate Pathol- 
ogy and Cell Culture, K. Maramorosh (ed.) Academic Press, 
Inc. New York, pp 85-99. 

(List continued on next page.) 

Primary Examiner— Lynette R. F. Smith 
Assistant Examiner — Amy J. Nelson 
Attorney, Agent, or F/r/w-^Saliwanchik, 
Saliwanchik 



Lloyd & 



[57] 



ABSTRACT 



Synthetic Baccilus thuringiensis toxin genes designed to be 
expressed in plants at a level higher than naturally-occurring 
Bt genes are provided. These genes utilize codons preferred 
in highly expressed monocot or dicot proteins. 
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CODING REGION OF SYNTHETIC BH GENE 
DIVIDED INTO 13 SEGMENTS 

SEGMENT A 

J SEGMENT M 
SEGMENT AM 

| SEGMENT BL 
SEGMENT ABLM 

| SEGMENT CK 
SEGMENT ABCKLM 

| SEGMENT DJ SEGMENT F 





SEGMENT ABCDJKLM 
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SEGMENT G 
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FIG. 2 
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SYNTHETIC INSECTICIDAL CRYSTAL and determined the nucleotide sequence. This crystal protein 

PROTEIN GENE HAVING A MODIFIED gene was contained on a 5.9 kb BamHI fragment 

FREQUENCY OF CODON USAGE (pNSBF544). A subclone containing the 3 kb Hindlll frag- 
ment from pNSBF544 was constructed. This HindHI frag- 

CROSS REFERENCES TO RELATED 5 ment contains an open reading frame (ORF) that encodes a 

APPLICATIONS 644-amino acid polypeptide of approximately 73 kDa. 

tu * • a- • • c i- ^ „ Extracts of both subclones exhibited toxicity to larvae of 

Jan 6 199S C ° l0rad ° beetle ^ptinotarsa delemlineata, a 

Jan 6, 1995, issued as U.S Pat No. 5,567,600, which is a coleopteran insect). 73- and 65-kDa peptides that cvosz- 

TtZTfZT ° f /PP f h ™ l C ^" / ? 57 'J 91 J * —ted with an antiserum against thecal pro Tno^l 
filed May 3, 1993 issued as U.S Pat. No. 5,380,831, which tenebrionis were produced on expression in E coli Sporu- 

Z\^linr toT. mS?2^ W S^ q " TZT U ° f 73 " kDa pe P lide thal co ^Ponds to the expected product 
XS^I No. 07/242,482, filed Sep^ 9, 1988, now fr om the ORF of pNSBP544. However, isolated crystals 
abandoned, all of winch are incorporated herein by refer- ^ primarily contain a ^-kDa component / When me crystal 

" Ce * protein gene was shortened at the N-terminal region, the 

FIFI n OF THP TMVPNrnnM dominant protein product obtained was the 65-kDa peptide, 

nci^r irttiiNViiiNJiuiN A deletion derivative, P 544Pst-Met5, was enzymatically 

This invention relates to the field of bacterial molecular derived from the 5.9 kb BamHI fragment upon removal of 
biology and, in particular, to genetic engineering by recom- 20 fo rtv - s ix amino acid residues from the N-terminus. Expres- 
binant technology for the purpose of protecting plants from sion of * ne N-terminal deletion derivative, p544Pst-Met5, 
insect pests. Disclosed herein ar ethe chemical synthesis of resulted in the production of, almost exclusively, the 65 kDa 
a modified crystal protein gene from Bacillus thuringiensis P ro tein. Recently, McPherson et al. (1988) Biotechnology 
var. tenebrionis (Bit), and the selective expression of this 6:61-66 demonstrated that the Btt gene contains two func- 
synthetic insecticidal gene. Also disclosed is the transfer of 25 lional translational initiation codons in the same reading 
the cloned synthetic gene into a host microoorganism, frame . fading to the production of both the full-length 
rendering the organism capable of producing, at improved P rote "i and an N-terminal truncated form, 
levels of expression, a protein having toxicity to insects. Chimeric toxin genes from several strains of Bt have been 

This invention facilitates the genetic engineering of bacteria expressed in plants. Four modified Btl genes from var. 
and plants to attain desired expression levels of novel toxins 30 ber liner 1715, under the control of the 2' promoter of the 
having agronomic value. Agrobacterium TR-DNA, were transferred into tobacco 

plants (Vaeck et al. (1987) Nature 328:33-37). Insecticidal 
BACKGROUND OF THE INVENTION levels of toxin were produced when truncated genes were 

r ri,rm„ m ™c,v ib*\ , expressed in transgenic plants. However, the steady state 

Hnrino th * f ] m ltS ablUty 10 Pr0dUCe ' 35 mRNA levels in the transgenic plants were so low that they 

dunng the process of Ration protemaceous, crystalline could not be reliaW ^ ^ £ St ^Zl^S^ 
inclusions which are found lo be hiehlv toxic to several u - c _■ • • . " ,ulc,u uiul a ™iysis ana 

• „„, f . „ . . several nence were n Uantl fi eu usmg ribonuclease protection exoeri- 

insect pests of agricultural importance. The crystal Droteins „, D , J Km , . , u«"»pui«.uuuMij,n 
of HifWh. Rt JUinc h,„» , .1. u . P rolcm * ments. Bt mRNA levels in plants producing the h ghest level 

of different B/ strains have a rather narrow host range and of tein corresponded \ 0 _o<x>o 1% th e poi y(A r 
hence are used commercially as very selective biological mRNA P°'WV 
insecticides. Numerous strains of Bt are toxic to lepi- i ,u . u u , . 

dopteran and dip.eran insects. Recently two subspecies (or h KP ° V V ( 1987 > su P ra > «P««io«« of 
varieties) of Bt have been reported to be pathogenic to Cb,meriC &™ ^ conta " Mn S the cntire cod »>S ^"ce of Btl 
coleopteran insects: var. tenebrionis (Krieg e. al. (1983) Z. ZL3 '° L ho 5*, A con,ain,D g *2 genes. 

Angew. En.omol. 96:500-508) and var san dLo J^r-ii some T-DNA constructs included a chimeric 

(Herrnstadt et al. (1986) Biotechnol. 4:305-308) Both 45 ^ ". a j™ 1 "* selectable in plants, whereas other 

strains produce flat, rectangular crystal inclusions and have ^'Tnd TZ^Z T ^ , betwee " fra g™ n * 
a major crystal component of 64-68 kDa (Herrnstadt e. al ° f ^ a " d NFm ^ J°f cllcidal levek ° f to *"> w «e 
supra; Bernhard (1986) FEMS Microbiol Lett P roduced truncated Btl genes or fusion constructs 

33:261-265) were ex P ressed 111 transgenic plants. Greenhouse grown 

rp • , , . . , „ so plants produced -O.02% of the total soluble protein as the 

cJZSr b0m * Ven \ ^ ot f have be™ toxin, or 3 /<g of toxin per g. fresh leaf tissue^nd, even a. 
cloned and the recombinant clones were found to be toxic to five-fold lower levels, showed 100% mortality in six^lay 

coleopteran-acuve toxm genes have also been isolated and cou i d be obtained using the intact Bt2 coding sequence, 

fr?Jr^, FnT™ V 3 5 8 kb BamHI 55 des P' ,e me fact lhat ,he - sarac P rom »'" was used .o direct its 

fragmen of Bt var. san diego DNA The protein expressed expression. Intact B/2 protein and RNA yields in the trans- 

»LZ ', ^ r leafbee.le) and had ge „ic plant leaves were 10-50 times lower than those for the 

a molecular weigh, of approximately 83 kDa. This 83 kDa lru „ C ated B,2 polypeptides or fusion proteins. 

r 0 4 p s P :,if„ r - zs^z 'r,s „ Barton et ^ li r p,am physio1 - 85:iio3 - no9 showed 

may oe syntnesizea as a larger precursor molecule that is kb gene (encoding a 645 amino acid protein followed bv two 

being formed fnto a c^taf. ^ ^ * * «° ^ ^ ^ 3 ^ («> VHW 

„ / sequence. Under these conditions expression was observed 

aA -ZtZ-ZLJt ..I " 87) NaU Acad - Sci - USA 65 fo^/mRNAallevelsupto47pg/20 //g RNAandl2 n g/mg 

8^7036-7040 ; U S. patent apphcation Sen No. 108,285, plant protein. This amount of Bt protein in plant tissue 
tiled Oct. 13, 1987 isolated (he crystal protein gene from Btt produced 100% mortality in two days. This level of expres- 
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sion still represents a low level of mRNA (2.5xl0" 4 %) and native Btt structural gene is modified in order to contain 

protein (1.2x10'- %). codons preferred by highly expressed plant genes, to attain 

Various hybrid proteins consisting of N-terminal frag- an A+T con lent in nucleotide base composition substantially 
ments of increasing length of the Btl protein fused to NPTII that found in plants, and also preferably to form a plant 
were produced in E. coli by Hofte el al. (1988) FEBS Lett. 5 initiation sequence, and to eliminate sequences that cause 
226:364-370. Fusion proteins containing the first 607 amino deslabilization, inappropriate polyadenylation, degradation 
acids of Btl exhibited insect toxicity; fusion proteins not and termination of RNA and to avoid sequences that con- 
containing this minimum N-terminal fragment were non- stitute secondary structure hairpins and RNA splice sites. In 
toxic. Appearance of NPTII activity was not dependent upon the synthetic genes, codons used to specify a given amino 
the presence of insecticidal activity; however, the confer- 10 acid are selected with regard to the distribution frequency of 
mation of the Btl polypeptide appeared to exert an important codon employed in highly expressed plant genes to 
influence on the enzymatic activity of the fused NPTII specify that amino acid. As is appreciated by those skilled in 
protein. This study did suggest that the global 3-D structure the art, the distribution frequency of codon usage utilized in 
of the Btl polypeptide is disturbed in truncated polypep- the synthetic gene is a determinant of the level of expression. 
tlc * es * 15 Hence, the synthetic gene is designed such that its distribu- 

A number of researchers have attempted to express plant tion frequency of codon usage deviates, preferably, do more 

genes in yeast (Neill et al. (1987) Gene 55:303-317; Roth- than 25% from that of highly expressed plant genes and, 

stein et al. (1987) Gene 55:353-356; Coraggio et al. (1986) more preferably, no more than about 10%. In addition, 

EMBO J. 5:459^65) and E. coli (Fuzakawa et al. (1987) consideration is given to the percentage G+C content of the 

FEBS Lett. 224:125-127; Vies et al. (1986) EMBO J. 20 degenerate third base (monocotyledons appear to favor G+C 

5:2439-2444; Gatenby et al. (1987) Eur. J. Biochem. in this position, whereas dicotyledons do not). It is also 

168:227-231). In the case of wheat a-gliadin (Neill et al. recognized that the XCG nucleotide is the least preferred 

(1987) supra), a-amylase (Rothstein et al. (1987) supra) codon in dicots whereas the XTA codon is avoided in both 

genes, and maize zein genes (Coraggio et al. (1986) supra) monocots and dicots. The synthetic genes of this invention 

in yeast, low levels of expression have been reported. Neill 25 also preferably have CG and TA doublet avoidance indices 

et al. have suggested that the low levels of expression of as defined in the Detailed Description closely approximating 

a-gliadin in yeast may be due in part to codon usage bias, those of the chosen host plant. More preferably these indices 

since a-gliadin codons for Phe, Leu, Ser, Gly, Tyr and deviate from that of the host by no more than about 10-15%. 

especially Glu do not correlate well with the abundant yeast Assembly of the Bt gene of this invention is performed 

isoacceptor tRNAs. In E. coli however, soybean glycinin A2 30 using standard technology known to the art. The Btt struc- 

(Fuzakawa et al. (1987) supra) and wheat RuBPC SSU (Vies tural gene designed for enhanced expression in plants of the 

et al. (1986) supra; Gatenby et al. (1987) supra) are specific embodiment is enzymatically assembled within a 

expressed adequately. DNA vector from chemically synthesized oligonucleotide 

Not much is known about the makeup of tRNA popula- duplex segments. The synthetic Bt gene is then introduced 

tions in plants. Viotti et al. (1978) Biochim. Biophys. Acta 35 into a plant host cell and expressed by means known to the 

5 17: 125-132 report that maize endosperm actively synthe- art. The insecticidal protein produced upon expression of the 

sizing zein, a storage protein rich in glut amine, leucine, and synthetic Bt gene in plants is functionally equivalent to a 

alanine, is characterized by higher levels of accepting activ- native Bt crystal protein in having toxicity to the same 

ity for these three amino acids than are maize embryo insects. 

tRNAs. This may indicate that the tRNA population of 40 OFSCRFPTinN ofthf Firing 

specific plant tissues may be adapted for optimum transla- BRIEF DESCR IP™N G F THE FIGURES 

tion of highly expressed proteins such as zein. To our FIGS. 1A and IB present the nucleotide sequence for the 

knowledge, no one has experimentally altered codon bias in synthetic Btt gene. Where different, the native sequence as 

highly expressed plant genes to determine possible effects of found in p544Pst-Met5 is shown above. Changes in amino 

the protein translation in plants to check the effects on the 45 acids (underlined) occur in the synthetic sequence with 

level of expression. alanine replacing threonine at residue 2 and leucine replac- 



SUMMARY OF THE INVENTION 



ing the stop at residue 596 followed by the addition of 
13-amino acids at the C-terminus. 



It is the overall object of the present invention to provide FIG. 2 represents a simplified scheme used in the con- 

a means for plant protection against insect damage. The 50 struction of the synthetic Btt gene. Segments A through M 

invention disclosed herein comprises a chemically synthe- represent oligonucleotide pieces annealed and ligated 

sized gene encoding an insecticidal protein which is func- together to form DNA duplexes having unique splice sites to 

tionally equivalent to a native insecticidal protein of Bt. This allow specific enzymatic assembly of the DNA segments to 

synthetic gene is designed to be expressed in plants at a level give the desired gene. 

higher than a native Bt gene It Ls preferred that the synthetic 55 FIG. 3 is a schematic diagram showing the assembly of 

gene be designed to be highly expressed in plants as defined oligonucleotide segments in the construction of a synthetic 

herein. Preferably the synthetic gene is at least approxi- Bn gene> ^ ch scgment (A through M) is 5uUl from oii 

mately 85% homologous to an insecticidal protein gene of nucleo tides of different sizes, annealed and hated to form the 

desired DNA segment. 

It is a particular object of this invention to provide a eo 

synthetic structural gene coding for an insecticidal protein BRIEF DESCRIPTION OF THE SEQUENCES 

from Btt having, for example, the nucleotide sequences SEQ ID NO. 1 is the native DNA gene sequence corre- 

presented in FIG. 1 and spanning nucleotides 1 through 1793 sponding to FIG. 1. 

or spanning nucleotide 1 through 1833 with functional SEQ ID NO. 2 is the corresponding amino acid sequence 

equivalence. 65 encoded by SEQ ID NO. 1. 

In designing synthetic Btt genes of this invention for SEQ ID NO. 3 is the synthetic DNA gene sequence corre- 

enhanced expression in plants, the DNA sequence of the sponding to FIG. 1. 
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SEQ ID NO. 4 is the corresponding amino acid sequence 

encoded by SEQ ID NO. 3. 
SEQ ID NO. 5 is the sequence of a synthetic DNA linker 

described in Example l(i). 
SEQ ID NO. 6 is the sequence of the 5' plant consensus 

splice site found in Example l(iiiXd). 
SEQ ID NO. 7 is the sequence of the 3' plant consensus 

splice site found in Example l(iiiXd). 
SEQ ID NO. 8 is the nucleotide sequence of Segment A 

found in Table 4. 
SEQ ID NO. 9 is the nucleotide sequence of Segment M 

found in Table 5. 

DETAILED DESCRIPTION OF THE 
INVENTION 

The following definitions are provided in order to provide 
clarity as to the intent or scope of their usage in the 
Specification and claims. 

Expression refers to the transcription and translation of a 
structural gene to yield the encoded protein. The synthetic Bt 
genes of the present invention are designed to be expressed 
at a higher level in plants than the corresponding native Bt 
genes. As will be appreciated by those skilled in the art, 
structural gene expression levels are affected by the regu- 
latory DNA sequences (promoter, polyadenylation sites, 
enhancers, etc.) employed and by the host cell in which the 
structural gene is expressed. Comparisons of synthetic Bt 
gene expression and native Bt gene expression must be made 
employing analogous regulatory sequences and in the same 
host cell. It will also be apparent that analogous means of 
assessing gene expression must be employed in such com- 
parisons. 

Promoter refers to the nucleotide sequences at the 5' end 
of a structural gene which direct the initiation of transcrip- 
tion. Promoter sequences are necessary, but not always 
sufficient, to drive the expression of. a downstream gene. In 
prokaryotes, the promoter drives transcription by providing 
binding sites to RNA polymerases and other initiation and 
activation factors. Usually promoters drive transcription 
preferentially in the downstream direction, although promo- 
tional activity can be demonstrated (at a reduced level of 
expression) when the gene is placed upstream of the pro- 
moter. The level of transcription is. regulated by promoter 
sequences. Thus, in the construction of heterologous 
promoter/structural gene combinations, the structural gene is 
placed under the regulatory control of a promoter such that 
the expression of the gene is controlled by promoter 
sequences. The promoter is positioned preferentially 
upstream to the structural gene and at a distance from the 
transcription start site that approximates the distance 
between the promoter and the gene it controls in its natural 
setting. As is known in the art, some variation in this 
distance can be tolerated without loss of promoter function. 

A gene refers to the entire DNA portion involved in the 
synthesis of a protein. A gene embodies the structural or 
coding portion which begins at the 5' end from the transla- 
lional start codon (usually ATG) and extends to the stop 
(TAG, TGA or TAA) codon at the 3' end. It also contains a 
promoter region, usually located 5' or upstream to the 
structural gene, which initiates and regulates the expression 
of a structural gene. Also included in a gene are the 3' end 
and poly(A)+ addition sequences. 

Structural gene is that portion of a gene comprising a 
DNA segment encoding a protein, polypeptide or a portion 
thereof, and excluding the 5' sequence which drives the 
initiation of transcription. The structural gene nay be one 
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which is normally found in the cell or one which is not 
normally found in the cellular location wherein it is 
introduced, in which case it is termed a heterologous gene. 
A heterologous gene may be derived in whole or in part from 
5 any source know to the art, including a bacterial genome or 
episome, eukaryotic, nuclear or plasraid DNA, cDNA, viral 
DNA or chemically synthesized DNA. A structural gene 
may contain one or more modifications in either the coding 
or the untranslated regions which could affect the biological 
no activity or the chemical structure of the expression product, 
the rate of expression or the manner of expression control. 
Such modifications include, but are not limited to, 
mutations, insertions, deletions and substitutions of one or 
more nucleotides. The structural gene may constitute an 
15 uninterrupted coding sequence or it may include one or more 
introns, bounded by the appropriate splice junctions. The 
structural gene may be a composite of segments derived 
from a plurality of sources, naturally occurring or synthetic. 
The structural gene may also encode a fusion protein. 
20 Synthetic gene refers to a DNA sequence of a structural 
gene that is chemically synthesized in its entirety or for the 
greater part of the coding region. As exemplified herein, 
oligonucleotide building blocks are synthesized using pro- 
cedures known to those skilled in the art and are ligated and 
25 annealed to form gene segments which are then enzymati- 
cally assembled to construct the entire gene. As is recog- 
nized by those skilled in the art, functionally and structurally 
equivalent genes to the synthetic genes described herein may 
be prepared by site-specific mutagenesis or other related 
30 methods used in the art. 

Transforming refers to stably introducing a DNA segment 
carrying a functional gene into an organism that did not 
previously contain that gene. 
J5 Plant tissue includes differentiated and undifferentiated 
tissues of plants, including but not limited to, roots, shoots, 
leaves, pollen, seeds, tumor tissue and various forms of cells 
in culture, such as single cells, protoplasts, embryos and 
callus tissue. The plant tissue may be in planta or in organ, 
An tissue or cell culture. 

Plant cell as used herein includes plant cells in planta and 
plant cells and protoplasts in culture. 

Homology refers to identity or near identity of nucleotide 
or amino acid sequences. As is understood in the art, 
45 nucleotide mismatches can occur at the third or wobble base 
in the codon without causing amino acid substitutions in the 
final polypeptide sequence. Also, minor nucleotide modifi- 
cations (e.g., substitutions, insertions or deletions) in certain 
regions of the gene sequence can be tolerated and considered 
50 insignificant whenever such modifications result in changes 
in amino acid sequence that do not alter functionality of the 
final product. It has been shown that chemically synthesized 
copies of whole, or parts of, gene sequences can replace the 
corresponding regions in the natural gene without loss of 
55 gene function. Homo logs of specific DNA sequences may be 
identified by those skilled in the art using the test of 
cross-hybridization of nucleic acids under conditions of 
stringency as Ls well understood in the art (as described in 
Hames and Higgens (eds.) (1985) Nucleic Acid 
60 Hybridization , IRL Press, Oxford, UK). Extent of homology 
is often measured in terms of percentage of identity between 
the sequences compared. 

Functionally equivalent refers to identity or near identity 
of function. A synthetic gene product which is toxic to at 
65 least one of the same insect species as a natural Bt protein 
is considered functionally equivalent thereto. As exemplified 
herein, both natural and synthetic Btt genes encode 65 kDa 
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insecticidal proteins having essentially identical amino acid 
sequences and having toxicity to coleopteran insects. The 
synthetic Bt genes of the present invention are not consid- 
ered to be functionally equivalent to native Bt genes, since 
they are expressible at a higher level in plants than native Bt 
genes. 

Frequency of preferred codon usage refers to the prefer- 
ence exhibited by a specific host cell in usage of nucleotide 
codons to specify a given amino acid. To determine the 
frequency of usage of a particular codon in a gene, the 
number of occurrences of that codon in the gene is divided 
by the total number of occurrences of all codons specifying 
the same amino acid in the gene. Table 1, for example, gives 
the frequency of codon usage for Bt genes, which was 
obtained by analysis of four Bt genes whose sequences are 
publicly available. Similarly, the frequency of preferred 
codon usage exhibited by a host cell can be calculated by 
averaging frequency of preferred codon usage in a large 
number of genes expressed by the host cell. It is preferable 
that this analysis be limited to genes that are highly 
expressed by the host cell. Table 1, for example, gives the 
frequency of codon usage by highly expressed genes exhib- 
ited by dicotyledonous plants, and raonocotyledonous 
plants. The dicot codon usage was calculated using 154 
highly expressed coding sequences obtained from Genbank 
which are listed in Table 1. Monocot codon usage was 
calculated using 53 monocot nuclear gene coding sequences 
obtained from Genbank and listed in Table 1, located in 
Example 1. 

When synthesizing a gene for improved expression in a 
host call it is desirable to design the gene such that its 
frequency of codon usage approaches the frequency of 
preferred codon usage of the host cell. 

The percent deviation of the frequency of preferred codon 
usage for a synthetic gene from that employed by a host cell 
is calculated first by determining the percent deviation of the 
frequency of usage of a single codon from that of the host 
cell followed by obtaining the average deviation over all 
codons. As defined herein this calculation includes unique 
codons (i.e., ATG and TGG). The frequency of preferred 
codon usage of the synthetic Btt gene, whose sequence is 
given in FIGS. 1A--C, is given in Table 1. The frequency of 
preferred usage of the codon *GTA for valine in the syn- 
thetic gene (0.10) deviates from that preferred by dicots 
(0.12) by 0.02/0.12=0.167 or 16.7%. The average deviation 
over all amino acid codons of the Btt synthetic gene codon 
usage from that of dicot plants is 7.8%. In general terms the 
overall average deviation of the codon usage of a synthetic 
gene from that of a host cell is calculated using the equation 

J — 1 x 100 



where X„«*frequency of usage for codon n in the host cell; 
Y, ^frequency of usage for codon n in the synthetic gene. 
Where n represents an individual codon that specifies an 
amino acid, the total number of codons is Z, which in the 
preferred embodiment is 61. The overall deviation of the 
frequency of codon usage, A, for all amino acids should 
preferably be less than about 25%, and more preferably less 
than about 10%. 

Derived from is used to mean taken, obtained, received, 
traced, replicated or descended from a source (chemical 65 
and/or biological). A derivative may be produced by chemi- 
cal or biological manipulation (including but not limited to 
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substitution, addition, insertion, deletion, extraction, 
isolation, mutation and replication) of the original source. 

Chemically synthesized, as related to a sequence of DNA, 
means that the component nucleotides were assembled in 
5 vitro. Manual chemical synthesis of DNA may be accom- 
plished using well established procedures (Caruthers, M. 
(1983) in Methodology of DNA andRNA Sequencing, Weiss- 
man (ed.), Praeger Publishers, New York, Chapter 1), or 
automated chemical synthesis can be performed using one of 
10 a number of commercially available machines. 

The term, designed to be highly expressed as used herein 
refers to a level of expression of a designed gene wherein the 
amount of its specific mRNA transcripts produced is suffi- 
cient to be quantified in Northern blots and, thus, represents 
15 a level of specific mRNA expressed corresponding to greater 
than or equal to approximately 0.001% of the poly(A)+ 
mRNA. To date, natural Bt genes are transcribed at a level 
wherein the amount of specific mRNA produced is insuffi- 
cient to be estimated using the Northern blot technique. 
20 However, in the present invention, transcription of a syn- 
thetic Bt gene designed to be highly expressed not only 
allows quantification of the specific mRNA transcripts pro- 
duced but also results in enhanced expression of the trans- 
lation product which is measured in insecticidal bio assays. 
25 Crystal protein or insecticidal crystal protein or crystal 
toxin refers to the major protein component of the parasporal 
crystals formed in strains of Bt. This protein component 
exhibits selective pathogenicity to different species of 
insects. The molecular size of the major protein isolated 
30 from parasporal crystals varies depending on the strain of Bt 
from which it is derived. Crystal proteins having molecular 
weights of approximately 132, 65, and 28 kDa have been 
reported. It has been shown that the approximately 132 kDa 
protein is a protoxin that is cleaved to form an approximately 
35 65 kDa toxin. 

The crystal protein gene refers to the DNA sequence 
encoding the insecticidal crystal protein in either full length 
protoxin or toxin form, depending on the strain of Bt from 
which the gene is derived. 
40 The authors of this invention observed that expression in 
plants of Bt crystal protein mRNA occurs at levels that are 
not routinely detectable in Northern blots and that low levels 
of Bt crystal protein expression correspond to this low level 
of mRNA expression. It is preferred for exploitation of these 
45 genes as potential biocontrol methods that the level of 
expression of Bt genes in plant cells be improved and that 
the stability of Bt mRNA in plants be optimized. This will 
allow greater levels of Bt mRNA to accumulate and will 
result in an increase in the amount of insecticidal protein in 
50 plant tissues. This is essential for the control of insects that 
are relatively resistant to Bt protein. 

Thus, this invention is based on the recognition that 
expression levels of desired, recombinant insecticidal pro- 
tein in transgenic plants can be improved via increased 
55 expression of stabilized mRNA transcripts; and that, 
conversely, detection of these stabilized RNA transcripts 
may be utilized to measure expression of translational 
product (protein). This invention provides a means of resolv- 
ing the problem -of low expression of insecticidal protein 
60 RNA in plants and, therefore, of low protein expression 
through the use of an improved, synthetic gene specifying an 
insecticidal crystal protein from Bt. 

Attempts to improve the levels of expression of Bt genes 
in plants have centered on comparative studies evaluating 
parameters such as gene type, gene length, choice of 
promoters, addition of plant viral untranslated RNA leader, 
addition of intron sequence and modification of nucleotides 
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surrounding the initiation ATG codon. To date, changes in for maintaining mRNAstahility in yeast. Without douht, the 

these parameters have not led to significant enhancement of degree of codon bias is an important factor to consider when 

Bt protein expression in plants. Applicants find that, engineering high expression of heterologous genes in yeast 

surprisingly, to express Bt proteins at the desired level in and other systems. 

plants, modifications in the coding region of the gene were 5 Experimental evidence obtained from point mutations and 

effective. Structural-function relationships can be studied deletion analysis has indicated that in eukaryotic genes 

using site-specific mutagenesis by replacement of restriction specific sequences are associated with post -transcriptional 

fragments with synthetic DNA duplexes containing the processing, RNA destabilization, translational termination, 

desired nucleotide changes (Lo et al. (1984) Proc. Natl. intron splicing and the like. These are preferably employed 
Acad. Sci. 81:2285-2289). However, recent advances in io in the synthetic genes of this invention. In designing a 

recombinant DNA technology now make it feasible to bacterial gene for expression in plants, sequences which 

chemically synthesize an entire gene designed specifically interfere with the efficacy of gene expression are eliminated, 

for a desired function. Thus, the Btt coding region was In designing a synthetic gene, modifications in nucleotide 

chemically synthesized, modified in such a way as to sequence of the coding region are made to modify the A+T 
improve its expression in plants. Also, gene synthesis pro- 15 content in DNA base composition of the synthetic gene to 

vides the opportunity to design the gene so as to facilitate its reflect that normally found in genes for highly expressed 

subsequent mutagenesis by incorporating a number of proteins native to the host cell. Preferably the A+T content 

appropriately positioned restriction endonuclease sites into of the synthetic gene is substantially equal to that of said 

the gene. genes for highly expressed proteins. In genes encoding 

The present invention provides a synthetic Bt gene for a 20 highly expressed plant proteins, the A+T content is approxi- 

crystal protein toxic to an insect. As exemplified herein, this mately 55%. It is preferred that the synthetic gene have an 

protein is toxic to coleopteran insects. To the end of improv- A+T content near this value, and not sufficiently high as to 

ing expression of this insecticidal protein in plants, this cause destabilization of RNA and, therefore, lower the 

invention provides a DNA segment homologous to a Btt protein expression levels. More preferably, the A+T content 
structural gene and, as exemplified herein, having approxi- 25 is no more than about 60% and most preferably is about 

mately 85% homology to the Btt structural gene in p544Pst- 55%. Also, for ultimate expression in plants, the synthetic 

Met5. In this embodiment the structural gene encoding a Btt gene nucleotide sequence is preferably modified to form a 

insecticidal protein is obtained through chemical synthesis plant initiation sequence at the 5' end of the coding region, 

of the coding region. A chemically synthesized gene is used In addition, particular attention is preferably given to assure 
in this embodiment because it best allows for easy and 30 that unique restriction sites are placed in strategic positions 

efficacious accommodation of modifications in nucleotide to allow efficient assembly of oligonucleotide segments 

sequences required to achieve improved levels of cross- during construction of the synthetic gene and to facilitate 

expression. subsequent nucleotide modification. As a result of these 

Today, in general, chemical synthesis is a preferred modifications in coding region of the native Bt gene, the 

method to obtain a desired modified gene. However, to date, 35 preferred synthetic gene is expressed in plants at an 

no plant protein gene has been chemically synthesized nor enhanced level when compared to that observed with natural 

has any synthetic gene for a bacterial protein been expressed Bt structural genes. 

in plants. In this invention, the approach adopted for syn- In specific embodiments, the synthetic Bt gene of this 

thesizing the gene consists of designing an improved nucle- invention encodes a Btt protein toxic to coleopteran insects, 

otide sequence for the coding region and assembling the 40 Preferably, the toxic polypeptide is about 598 amino acids in 

gene from chemically synthesized oligonucleotide seg- length, is at least 75% homologous to a Btt polypeptide, and, 

ments. In designing the gene, the coding region of the as exemplified herein, is essentially identical to the protein 

naturally-occurring gene, preferably from the Btt subclone, encoded by p544Pst-Met5, except for replacement of threo- 

p544Pst-Met5, encoding a 65 kDa polypeptide having nine by alanine at residue 2. This amino acid substitution 

coleoperan toxicity, is scanned for possible modifications 45 results as a consequence of the necessity to introduce a 

which would result in improved expression of the synthetic guanine base at position +4 in the coding sequence, 

gene in plants. For example, to optimize the efficiency of In designing the synthetic gene of this invention, the 

translation, codons preferred in highly expressed proteins of coding region from the Btt subclone, p544Pst-Met5, encod- 

the host cell are utilized. ing a 65 kDa polypeptide having coleopteran toxicity, is 

Bias in codon choice within genes in a single species 50 scanned for possible modifications which would result in 

appears related to the level of expression of the protein improved expression of the synthetic gene in plants. For 

encoded by that gene. Codon bias is most extreme in highly example, in preferred embodiments, the synthetic insecti- 

expressed proteins of E. coli and yeast. In these organisms, cidal protein is strongly expressed in dicot plants, e.g., 

a strong positive correlation has been reported between the tobacco, tomato, cotton, etc., and hence, a synthetic gene 

abundance of an isoaccepting tRNA species and the favored 55 under these conditions is designed to incorporate to advan- 

synonymous codon. In one group of highly expressed pro- tage codons used preferentially by highly expressed dicot 

teins in yeast, over 96% of the amino acids are encoded by proteins. In embodiments where enhanced expression of 

only 25 of the 61 available codons (Bennetzen and Hall insecticidal protein is desired in a monocot, codons pre- 

(1982) J. Biol. Chem. 257:3026-3031). These 25 codons are ferred by highly expressed monocot proteins (given in Table 

preferred in all sequenced yeast genes, but the degree of 60 1) are employed in designing the synthetic gene, 

preference varies with the level of expression of the genes. In general, genes within a taxonomic group exhibit simi- 

Recently, Hoekema and colleagues (1987) Mol. Cell. Biol. larities in codon choice, regardless of the function of these 

7:2914-2924 reported that replacement of these 25 preferred genes. Thus an estimate of the overall use of the genetic code 

codons by minor codons in the 5' end of the highly expressed by a taxonomic group can be obtained by summing codon 

yeast gene PGK1 results in a decreased level of both protein 65 frequencies of all its sequenced genes. This species-specific 

and mRNA. They concluded that biased codon choice in codon choice is reported in this invention from analysis of 

highly expressed genes enhances translation and is required 208 plant genes. Both monocot and dicot plants are analyzed 
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individually to determine whether these broader taxonomic 
groups are characterized by different patterns of synony- 
mous codon preference. The 208 plant genes included in the 
codon analysis code for proteins having a wide range of 
functions and they represent 6 monocot and 36 dicot species. 
These proteins are present in different plant tissues at 
varying levels of expression. 

In this invention it is shown that the relative .use of 
synonymous codons differs between the monocots and the 
dicots. In general, the most important factor in discriminat- 
ing between monocot and dicot patterns of codon usage is 
the percentage G+C content of the degenerate third base. In 
monocots, 16 of 18 amino acids favor G+C in this position, 
while dicots only favor G+C in 7 of 18 amino acids. 

The G ending codons for Thr, Pro, Ala and Ser are 
avoided in both inonocots and dicots because they contain C 
in codon position II. The CG dinucleotide is strongly 
avoided in plants (Boudraa (L987) Genet. Sel. Evol. 
19:143-154) and other eukaryotes (Grantham et al. (1985) 
Bull. Inst. Pasteur 83:95-148), possibly due to regulation 
involving methylation. In dicots, XCG is always the least 
favored codon, while in monocots this is not the case. The 
doublet TA is also avoided in codon positions [I and III in 
most eukaryotes, and this is true of both nonocots and dicots. 

Grantham and colleagues (1986) Oxford Surveys in Evol. 
Biol. 3:48-61 have developed two codon choice indices to 
quantify CG and TA doublet avoidance in codon positions II 
and III. XCG/XCC is the ratio of codons having C as base 
II of G-ending to C-ending triplets, while XTA/XTT is the 
ratio of A-ending to Tending triplets with T as the second 
base. These indices have been calculated for the plant data 
in this paper (Table 2) and support the conclusion that 
monocot and dicot species differ in their use of these 
dinucleotides. 

TABLE 2 



Avoidance of CG and TA doublets in codons position II— III. 
XCG/XCC and XTA/XAA values are multiplied by 100. 







Di- 


Mono- 




Soy- 


RuBPC 




Group 


Plants 


cots 


cots 


Maize 


bean 


SSU 


CAB 


XCG/XCC 


40 


30 


61 


67 


37 


18 




XTA/XTT 


37 


35 


47 


43 


41 


9 


13 



RuBPC SSU = ribulose 1,5 bisphosphate small subunit 
CAB - chlorophyll a/b binding protein 

Additionally, for two species, soybean and maize, 
species-specific codon usage profiles were calculated (not 
shown). The maize codon usage pattern resembles that of 
monocots in general, since these sequences represent over 
half of the monocot sequences available. The codon profile 
of the maize subsample is even more strikingly biased in its 
preference for G+C in codon position III. On the other hand, 
the soybean codon usage pattern is almost identical to the 
general dicot pattern, even though it represents a much 
smaller portion of the entire dicot sample. 

In order to determine whether the coding strategy of 
highly expressed genes such as the ribulose 1,5 bisphosphate 
small subunit (RuBPC SSU) and chlorophyll a/b binding 
protein (CAB) is more biased than that of plant genes in 
general, codon usage profiles for subsets of these genes (19 
and 17 sequences, respectively) were calculated (not 
shown). The RuBPC SSU and CAB pooled samples are 
characterized by stronger avoidance of the codons XCG and 
XTA than in the larger monocot and dicot samples (Table 2). 
Although most of the genes in these subsamples are dicot in 
origin (17/19 and 15/17), their codon profile resembles that 
of the monocots in that G+C is utilized in the degenerate 
base III. 



,891 

12 

The use of pooled data for highly expressed genes may 
obscure identification of species-specific patterns in codon 
choice. Therefore, the codon choices of individual genes for 
RuBPC SSU and CAB were tabulated. The preferred codons 
5 of the maize and wheat genes for RuBPC SSU and CAB are 
more restricted in general than are those of the dicot species. 
This is in agreement with Matsuoka et al. (1987) J. Biochem. 
102:673-676) who noted the extreme codon bias of the 
maize RuBPC SSU gene as well as two other highly 
3Q expressed genes in maize leaves, CAB and phospho- 
enolpyruvate carboxylase. These genes almost completely 
avoid the use of A+T in codon position III, although this 
codon bias was not as pronounced in non-leaf proteins such 
as alcohol dehydrogenase, zein 22 kDa sub-unit, sucrose 
synthetase and ATP/ADP translocator. Since the wheat SSU 
15 and CAB genes have a similar pattern of codon preference, 
this may reflect a common monocot pattern for these highly 
expressed genes in leaves. The CAB gene for Lemna and the 
RUBPC SSU genes for Chlamdomonas share a similar 
extreme preference for G+C in codon position III. In dicot 
20 CAB genes, however, A+T degenerate bases are preferred 
by some synonymous codons (e.g., GCT for Ala, CTT for 
Leu, GGA and GGT for Gly). In general, the G+C prefer- 
ence is less pronounced for both RuBPC SSU and CAB 
genes in dicots than in monocots. 
25 In designing a synthetic gene for expression in plants, 
attempts are also made to eliminate sequences which inter- 
fere with the efficacy of gene expression. Sequences such as 
the plant polyadenylation signals, e.g., AATAAA, poly- 
merase D termination sequence, e.g., CAN (7 . 9) AGTNNAA, 
30 UCUUCGG hairpins and plant consensus splice sites are 
highlighted and, if present in the native Btt coding sequence, 
are modified so as to eliminate potentially deleterious 
sequences. 

Modifications in nucleotide sequence of the Btt coding 
35 region are also preferably made to reduce the A+T content 
in DNAbase composition. The Btt coding region has an A+T 
content of 64%, which is about 10% higher than that found 
in a typical plant coding region. Since A+T-rich regions 
typify plant intergenic regions and plant regulatory regions, 
40 it is deemed prudent to reduce the A+T content. The syn- 
thetic Btt gene is designed to have an A+T content of 55%, 
in keeping with values usually found in plants. 

Also, a single modification (to introduce guanine in lieu 
of adenine) at the fourth nucleotide position in the Btt coding 
45 sequence is made in the preferred embodiment to form a 
sequence consonant with that believed to function as a plant 
initiation sequence (Taylor et al. (1987) Mol. Gen. Genet. 
210:572-577) in optimization of expression. In addition, in 
exemplifying this invention thirty-nine nucleotides (thirteen 
50 codons) are added to the coding region of the synthetic gene 
in an attempt to stabilize primary transcripts. However, it 
appears that equally stable transcripts are obtained in the 
absence of this extension polypeptide containing thirty-nine 
nucleotides. 

55 Not all of the above-mentioned modifications of the 
natural Bt gene must be made in constructing a synthetic Bt 
gene in order to obtain enhanced expression. For example, 
a synthetic gene may be synthesized for other purposes in 
addition to that of achieving enhanced levels of expression. 

60 Under these conditions, the original sequence of the natural 
Bt gene may be preserved within a region of DNA corre- 
sponding to one or more, but not all, segments used to 
construct the synthetic gene. Depending on the desired 
purpose of the gene, modification may encompass substitu- 

65 tion of one or more, but not all, of the oligonucleotide 
segments used to construct the synthetic gene by a corre- 
sponding region of natural Bt sequence. 
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As is known to those skilled in the art of synthesizing (1985) Proc. Natl. Acad. Sci. USA 82:5824), microinjection 

genes (Mandecki et al. (1985) Proc. Natl. Acad. Sci. (Crossway, A. et al. (1986) Mol. Gen. Genet. 202:179), or 

82:3543-3547; Feretti et al, (1986) Proc. Natl. Acad. Sci. T-DNA mediated transfer from Agrobacterium twnefaciens 

83:599-603), the DNA sequence to be synthesized is to the plant tissue. There appears lo be no fundamental 

divided into segment lengths which can be synthesized 5 limitation of T-DNA transformation to the natural host range 

conveniently and without undue complication. As exempli- of Agrobacterium. Successful T-DNA-mediated transforma- 

fied herein, in preparing to synthesize the Btt gene, the tion of monocots (Hooykaas-Van Slogteren, G. et al. (1984) 

coding region is divided into thirteen segments (A-M). Each Nature 311:763 X gymnosperm (Dandekar, A et al. (1987) 

segment has unique restriction sequences at the cohesive Biotechnology 5:587) and algae (Ausich, R., EPO applica- 

ends. Segment A, for example, is 228 base pairs in length 10 tl0n 108 > 580 ) has becn reported. Representative T-DNA 

and is constructed from six oligonucleotide sections, each no^ 

containing approximately 75 bases ^ (^3^ 

In* Si 7* Tof rh f H l ° EMBO J. 2:987; Herrer'a-Estrella, L. et al. (1985) in Plant 

ments The length of the protruding cohesive ends in Genetic Engineering ^ New York: Cambri V dge u nive rsity 

complementary ohgonucleoude segments is four to five is PresS) p . 63 . Qnce introduced into the plant tissue, the 

residues. In the strategy evolved for gene synthesis, the sites expression of the structural gene may be assayed by any 

designed for the joining of oligonucleoude pieces and DNA mea ns known to the art, and expression may be measured as 

segments are different from the restriction sites created in the mRNA transcribed or as protein synthesized. Techniques are 

gene, known for the in vitro culture of plant tissue, and in a 

In the specific embodiment, each DNA segment is cloned 20 number of cases, for regeneration into whole plants. Proce- 

into a pIC-20 vector for amplification of the DNA The dures for transferring the introduced expression complex to 

nucleotide sequence of each fragment is determined at this commercially useful cultivars are known to those skilled in 

stage by the dideoxy method using the recombinant phage the art. 

DNA as templates and selected synthetic oligonucleotides as In one of its preferred embodiments the invention dis- 

primers. 25 closed herein comprises expression in plant cells of a 

As exemplified herein and illustrated schematically in synthetic insect ic id al structural gene under control of a plant 

FIGS. 3 and 4, each segment individually (e.g., segment M) expressible promoter, that is to say, by inserting the insec- 

is excised at the flanking restriction sites from its cloning ticide structural gene into T-DNA under control of a plant 

vector and spliced into the vector containing segment A. expressible promoter and introducing the T-DNA containing 

Most often, segments are added as a paired segment instead 30 the insert into a plant cell using known means. Once plant 

of as a single segment to increase efficiency. Thus, the entire cells expressing a synthetic insecticidal structural gene 

gene is constructed in the original plasmid harboring seg- under control of a plant expressible promoter are obtained, 

ment A. The nucleotide sequence of the entire gene is plant tissues and whole plants can be regenerated therefrom 

determined and found to correspond exactly to that shown in using methods and techniques well-known in the art. The 

FIGS. 1A-C. 35 regenerated plants are then reproduced by conventional 

In preferred embodiments the synthetic Btt gene is means and the introduced genes can be transferred to other 

expressed in plants at an enhanced level when compared to strains and cultivars by conventional plant breeding tech- 

that observed with natural Btt structural genes. To that end, niques. 

the synthetic structural gene is combined with a promoter The introduction and expression of the synthetic structural 

functional in plants, the structural gene and the promoter 40 gene for an insecticidal protein can be used to protect a crop 

region being in such position and orientation with respect to from infestation with common insect pests. Other uses of the 

each other that the structural gene can be expressed in a cell invention, exploiting the properties of other insecticide 

in which the promoter region is active, thereby forming a structural genes introduced into other plant species will be 

functional gene. The promoter regions include, but are not readily apparent to those skilled in the art. The invention in 

limited to, bacterial and plant promoter regions. To express 45 principle applies to. introduction of any synthetic insecticide 

the promoter region/structural gene combination, the DNA structural gene into any plant species into which foreign 

segment carrying the combination is contained by a cell. DNA (in the preferred embodiment T-DNA) can be intro- 

Combinations which include plant promoter regions are duced and in which said DNA can remain stably replicated, 

contained by plant cells, which, in turn, may be contained by In general, these taxa presently include, but are not limited 

plants or seeds. Combinations which include bacterial pro- 50 to, gymnosperms and dicotyledonous plants, such as sun- 

moter regions are contained by bacteria, e.g., Bt or £. colL flower (family Compositeae), tobacco (family Solanaceae), 

Those in the art will recognize that expression in types of alfalfa, soybeans and other legumes (family Leguminoseae), 

micro-organisms other than bacteria may in some circum- cotton (family Malvaceae), and most vegetables, as well as 

stances be desirable and, given the present disclosure, fea- monocotyledonous plants. A plant containing in its tissues 

sible without undue experimentation. 55 increased levels of insecticidal protein will control less 

The recombinant DNA molecule carrying a synthetic sasceptible types of insect, thus providing advantage over 

structural gene under promoter control can be introduced present insecticidal uses of Bt. By incorporation of the 

into plant tissue by any means known to those skilled in the insecticidal protein into the tissues of a plant, the present 

art. The technique used for a given plant species or specific invention additionally provides advantage over present uses 

type of plant tissue depends on the known successful tech- 60 of insecticides by eliminating instances of nonuniform appli- 

niques. As novel means are developed for the stable inser- cation and the costs of buying and applying insecticidal 

tion of foreign genes into plant cells and for manipulating preparations to a field. Also, the present invention eliminates 

the modified cells, skilled artisans will be able to select from the need for careful timing of application of such prepara- 

known means to achieve a desired result. Means for intro- lions since small larvae are most sensitive to insecticidal 

ducing recombinant DNA into plant tissue include, but are 65 protein and the protein is always present, minimizing crop 

not limited to, direct DNA uptake (Paszkowski, J. et al. damage that would otherwise result from preapplication 

(1984) EMBO J. 3:2717), electroporation (Fromm, M. et al. larval foraging. 
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This invention combines the specific teachings of the 
present disclosure with a variety of techniques and expedi- 
ents known in the art. The choice of expedients depends on 
variables such as the choice of insecticidal protein from a Bi 
strain, the extent of modification in preferred codon usage, 
manipulation of sequences considered to be destabilizing to 
RNA or sequences prematurely terminating transcription, 
insertions of restriction sites within the design of the syn- 
thetic gene to allow future nucleotide modifications, addi- 
tion of introns or enhancer sequences to the 5' and/or 3 1 ends 
of the synthetic structural gene, the promoter region, the host 
in which a promoter region/structural gene combination is 
expressed, and the like. As novel insecticidal proteins and 
toxic polypeptides are discovered, and as sequences respon- 
sible for enhanced cross-expression (expression of a foreign 
structural gene in a given host) are elucidated, those of 
ordinary skill will be able to select among those elements to 
produce "improved" synthetic genes for desired-proteins 
having agronomic value. The fundamental aspect of the 
present invention is the ability to synthesize a novel gene 
coding for an insecticidal protein, designed so that the 
protein will be expressed at an enhanced level in plants, yet 
so that it will retain its inherent property of insect toxicity 
and retain or increase its specific insecticidal activity. 

EXAMPLES 

The following Examples are presented as illustrations, of 
embodiments of the present invention. They do not limit the 
scope of this invention, which is determined by the claims. 

The following strains were deposited with the Patent 
Culture Collection, Northern Regional Research Center, 
1815 N. University Street, Peoria, 111. 61604. 



Strain Deposited on Accession # 

E. coli MC1061 (p544-HindIir) 6 October 1987 NRRL B-18257 
E. coli MC1061 (p544Pst-Met5) 6 October 1987 NRRL B-18258 
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7, is digested with BglH and religated, thereby removing 
Bacillus sequences flanking the 3'-end of the crystal protein 
gene. The resulting plasmid, p405/54-12, is digested with 
PstI and religated, thereby removing Bacillus sequences 
5 flanking the 5'-end of the crystal protein and about 150 bp 
from the 5'-end of the crystal protein structural gene. The 
resulting plasmid, p405/81-4, is digested with SphI and PstI 
and is mixed with and ligated to a synthetic linker having the 
following structure: 

10 

SD MetThrAla (SEQ ID NO:5) 

5 ' CAGGATCCAACAATGACTGCA3 ' 
3 ' GTACGTCCTAGGTTGTTACTG5 ' 
Sph I Pst I 

15 (SD indicates the location of a Shine-Dalgarno prokaryotic 
ribosome binding site.) The resulting plasmid, p544Pst- 
Met5, contains a structural gene encoding a protein identical 
to one encoded by pNSBP544 except for a deletion of the 
amino-terminal 47 amino acid residues. The nucleotide 

20 sequence of the Btt coding region in p544Pst-Met5 is 
presented in FIG. 1. In bioassays (Sekar and Adang, U.S. 
patent application Ser. No. 108,285, supra), the proteins 
encoded by the full-length Btt gene in pNSBP544 and the 
N-terminal deletion derivative, p544Pst-Met5, were shown 

25 to be equally toxic. All of the plasmids mentioned above 
have their crystal protein genes in the same orientation as the 
lacZ gene of the vector, 
(ii) Modification of Preferred Codon Usage 

Table 1 presents the frequency of codon usage for (A) 

30 dicot proteins, (B) Bt proteins, (C) the synthetic Btt gene, 
and (D) monocot proteins. Although some codons for a 
particular amino acid are utilized to approximately the same 
extent by both dicot and Bt proteins (e.g., the codons for 
serine), for the most part, the distribution of codon fre- 

35 quency varies significantly between dicot and Bt proteins, as 
illustrated in columns A and B in Table 1. 

TABLE 1 



The deposited strains are provided for the convenience of 
those in the art, and are not necessary to practice the present 
invention, which may be practiced with the present disclo- 
sure in combination with publicly available protocols, 
information, and materials. E. coli MC1061, a good host for 
plasmid transformations, was disclosed by Casadaban, M.J. 
and Cohen, S. N. (1980) J. Mol. Biol. 138:179-207. 

Example 1 

Design of the Synthetic Insecticidal Crystal Protein 
Gene 

(i) Preparation of Toxic Subclones of the Btt Gene 

Construction, isolation, and characterization of pNSB544 
is disclosed by Sekar, V. et al. (1987) Proc. Natl. Acad. Sci. 
USA 84:7036-7040, and Sekar, V. and Adang, M. J., U.S. 
patent application Ser. No. 108,285, filed Oct. 13, 1987, 
which is hereby incorporated by reference. A 3.0 kbp 
Hindlll fragment carrying the crystal protein gene of 
pNSBP544 is inserted into the Hindlll site of pIC-20H 
(Marsh, J. L. et al. (1984) Gene 32:481-485), thereby 
yielding a plasmid designated p544-HindIII, which is on 
deposit. Expression in E. coli yields a 73 kDa crystal protein 
in addition to the 65 kDa species characteristic of the crystal 
protein obtained from Btt isolates. 

A 5.9 kbp Bam HI fragment carrying the crystal protein 
gene is removed from pNSBP544 and inserted into BamHl- 
linearized pIC-20H DNA. The resulting plasmid, p405/44- 



Fiequency of Codon Usage 



Distribution Fraction 



Amino 




(A)Dicot 


(B)Bt 


(C)Synthelic 


(D)Monoc 


Acid 


Codon 


Genes 


Genes 


Btt Gene 


Genes 


45 Gly 


GGG 


0.12 


0.08 


0.13 


0.21 


Gly 


GGA 


0.38 


0.53 


0.37 


0.17 


Glv 


GGT 


0.33 


0.24 


0.34 


0.18 


Gly 


GGC 


0.16 


0.16 


0.16 


0.43 


Glu 


GAG 


0.51 


0.13 


0.52 


0.75 


Glu 


GAA 


0.49 


0.87 


0.48 


0.25 


50 Asp 


GAT 


0.58 


0.68 


0.56 


0.27 


Asp 


GAC 


0.42 


0.32 


0.44 


0.73 


Val 


GTG 


0.29 


0.15 


0.30 


0.36 


Val 


GTA 


0.12 


0.32 


0.10 


0.08 


Val 


GTT 


0.39 


0.29 


0.35 


0.19 


Val 


GTC 


0.20 


0.24 


0.25 


0.37 


55 AIa 


GCG 


0.06 


0.12 


0.06 


0.22 


Ala 


GCA 


0.25 


0.50 


0.24 


0.16 


Ala 


GCT 


0.42 


0.32 


0.41 


0.24 


Ala 


GCC 


0.27 


0.06 


0.29 


0.38 


Lys 


AACi 


0.61 


0.13 


0.58 


0.86 


Lvs 


AAA 


. 0.39 


0.87 


0.42 


0.14 


60 

Asn 


AAT 


0.45 


0.79 


0.44 


0.25 


AAC 


0.55 


0.21 


0.56 


D.75 


Met 


ATG 


1.00 


1.00 


1.00 


1.00 


He 


ATA 


0.18 


0.30 


0.20 


0.11 


He 


ATT 


0.45 


0.57 


0.43 


0.24 


He 


ATC 


0.37 


0.13 


0.37 


0.64 


Thr 


ACG 


0.08 


0.14 


0.07 


0.20 


65 Thr 


ACA 


0.27 


0.68 


0.27 


0.14 


Thr 


ACT 


0.35 


0.14 


0.34 


0.19 
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TABLE 1-cominued 



Frequency of Codon Usage 



Thr 


AOC 


0.30 


0.05 


0.32 


0.46 


Trp 


TOG 


1.00 


1.00 


1.00 


1.00 


End 


TGA 


0.33 


0.00 


O.OO 


0.34 


Cys 


TOT 


0.44 


0.33 


0.33 


0.30 


Cys 


TGC 


0.56 


0.67 


0.67 


0.70 


End 


TAG 


0.39 


0.00 


O.OO 


0.36 


End 


TAA 


0.48 


1.00 


1.00 


0.30 


TjT 


TAT 


0.43 


0.81 


0.43 


0.21 


Tyr 


TAC 


0.57 


0.19 


0.57 


0.79 


Phe 


TTT 


0.45 


0.75 


0.44 


0.25 


Phe 


TTC 


0.55 


0.25 


0.56 


0.75 


Scr 


AGT 


0.14 


0.25 


0.13 


0.08 


Ser 


AGC 


0.18 


0.13 


0.19 


0.26 


Ser 


TCG 


0.06 


0.08 


0.06 


0.14 


Ser 


TCA 


0.19 


0.19 


0.17 


0.11 


Ser 


tct 


0.25 


0.25 


0.27 


0.15 


Ser 


TCC 


0.18 


0.10 


0.17 


0.25 


Arg 


AGG 


0.25 


0.09 


0.23 


0.26 


Arg 


AGA 


0.30 


0.50 


0.32 


0.09 


Arg 


CGG 


0.04 


0.14 


0.05 


0.13 


Arg 


CGA 


0.08 


0.14 


0.09 


0.04 


Arg 


CGT 


0.21 


0.09 


0.23 


0.12 


Arg 


CGC 


0.11 


0.05 


0.09 


0.36 


Gin 


CAG 


0.41 


0.18 


0.39 


0.46 


Glo 


CAA 


0.59 


0.82 


0.61 


0.54 


His 


CAT 


0.54 


0.90 


0.50 


0.33 


His 


CAC 


0.46 


0.10 


0.50 


0.67 


Leu 


TTG 


0.26 


0.08 


0.27 


0.14 


Leu 


tta 


0.10 


0.46 


0.12 


0.03 


Leu 


ctg 


0.09 


0.04 


0.10 


0.28 


Leu 


cta 


o.os 


0.21 


0.10 


0.10 


Leu 


CTT 


0.28 


0.15 


0.18 


0.15 


Leu 


CTC 


0.19 


0.06 


0.22 


0.31 


Pro 


CCG 


0.09 


0.20 


0.08 


. 0.23 


Pro 


CCA 


0.42 


0.56 


0.44 


0.34 


Pro 


CCT 


0.32 


0.24 


0.32 


0.17 


Pro 


CCC 


0.17 


0.00 


0.16 


0.26 



Frequency of Codon Usage 



154 coding sequences of dicot nuclear genes were used to 
compile the codon usage table. The pooled dicot coding 
sequences, obtained from Genoa nk (release 55) oi. when no 
Genbank file name is specified, directly from the published 
source, were: 



GENUS/SPECIES 



GENBANK 



PROTEIN 



Antirrhinum majus 
Arabidopsis thaliana 



AMACHS 

ATHADH 

ATHH3GA 

ATHH3GB 

ATHH4GA 

ATHLHCP1 

ATHTUBA 



Bertitolletia excelsa 

Brvssica campcstris 
Brassica napus 
Brassica otcacea 

Canavalia cnsiformis 
Carica papaya 
Cfdamdomonas 
reinbardtii 



Cucurbita pepo 
Cucwnis sativus 



BNANAP 
BOLSLSGR 

CENCONA 

CPAPAP 

CREC552 

CR0RBCS1 

CRERBCS2 

CUCPHT 
CUSGMS 

CUSLHCPA 
CUSSSU 



Chalcone synthetase 
Alcohol dehydrogenase 
Hist one 3 gene 1 
His tone 3 gene 2 
Histone 4 gene 1 
CAB 
a tubulin 

5-enolpyruvyl4hifate 

3-phosphate 

synthetase 

High methionine storage 
protein 

Acyl carrier protein 
Napin 

S-locus specific 
glycoprotein 
Conca naval in A 
Papain 

Preapocytochrome 

RuBPC small subunit 
gene 1 

RuBPC small subunit 

gene 2 

Phytrichrome 

Glyoxosomal ma late 

synthetase 

CAB 

RuBPC small subunit 



5 Daucus carol a 



Dolidtos biflorus 
Flaveria trinenia 
Glycine max 



15 



20 



25 



30 



Gossypium liirsutwn 
Helianthtis annus 



40 



45 



Ipomoea batatas 
Lemna gibba 

Lupinus luseus 

Lycopersicon 

esculetUum 



50 



55 



60 



Medicago saliva 
65 Mcscmbryanthcmum 
crystaliinum 



DAREXT Extensin 
DAREXTR 33 kD extensin related 
protein 

DBILECS seed lectin 
FTRBCR RuBPC small subunit 

SOY7SAA 75 storage protein 
SOYACT1G Actin 1 
SOYCOPI CIl protease inhibitor 
SOYGLYA1A Glycinin Ala Bx 
subunits 

SOYGLYAAB Glycinin A5A4B3 
subunits 

SOYGLYAB Glycinin A3/b4 subunits 
SOYGLYR Glycinin A2Bla subunits 
SOYHSP175 Low M W heat shock 
proteins 

SOYLGBI Leghemoglobin 
SOYLEA Lectin 
SOYOX Lipoxygenase 1 

SOYNOD20G 20 kDa nodulin 
SOYNOD23G 23 kDa nodulin 
SOYNOD24H 24 kDa nodulin 
SOYNOD26B 26 kDa nodulin 
SOYNOD26R 26 kDa nodulin 
SOYNOD27R 27 kDa nodulin 
SOYNOD35M 35 kDa nodulin 
SOYNOD75 75 kDa nodulin 
SOYNODR1 Nodulin C51 
SOYNODR2 Nodulin E27 
SOYPRP1 Proline rich protein 
SOYRUBP RuBPC small subunit 
SOYURA Urease 
SOYHSP26A Heat shock protein 26A 

Nuclear-encoded 4 
chloroplast 
heat shock protein 
22 kDa nodulin 5 
pi tubulin 6 
02 tubulin 6 
Seed a globulin (viciiin) 7 
Seed B globulin (viciiin) 7 
RuBPC small subunit 
2S albumin seed 8 
storage protein 

Wound -induced catalase 9 
CAB 

RuBPC small subunit 
leghemoglobin I 
Biotin binding protein 

Ethylene biosynthesis 
protein 

Polygalacturonase-^ 
Tomato photosystem I 
protein 

RuBPC small subunit 
RuBPC small subunil 
RuBPC small subunit 
RuBPC small subunit 
Ripening related protein 
Wound induced 
proteinase 
inhibitor I 
Wound induced 
proteinase 
inhibitor II 

CAB 1A 10 
CAB IB 10 
CAB 3C 10 
CAB 4 11 
CAB 5 11 
ALFLB3R Leghemoglobin III 

RuBPC small subunit 12 



HNNRUBCS 



LGIAB19 
LGIR5BPC 
LUPLBR 
TOMBIOBR 

TOMETHYBR 

TOMPG2AR 
TOMPSI 

TOMRBCSA 

TOMRBCSB 

TOMRBCSC 

TOMRBCSD 

TOMRRD 

TOMWIPIG 



TOMWIPII 



19 
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TABLE 1 -continued 



TABLE I-continued 



Frequency of Codon Usage 



Nicoriana 
plumbaginifotia 



Nicotiana tabacum 



Perseus americana 
Petroselinum 
hortatse 
Petunia sp. 



Pliascolus vulgaris 



Pisum sativum 



Raplianus sativus 
Ricitms communis 



SUaie pratettsis 



Frequency of Codon Usage 



TOBATP21 


Mitochondrial ATP 












R cilhtim't 






Nitrate reductase 


13 




Glutamine synthetase 


14 


TOBECH 


f^n/irvr*h it i nncp 




TOBGAPA 


A ctlKlinit ftf olilnrnnloct 

» » auduiui ui cniuiopiasi 






G3PD 




TOBGAPB 


B subunit of chloroplast 






G3PD 




TOBGAPC 


C subunit of chloroplast 






G3PD 




TOBPR1 AR 


Pathogenesis re Idled 






protein la 




TDRPRICR 


Pa thog e nesis -re la ted 






protein lc 




TOBPRPR 


Pathogenesis related 






protein lb 




TORPXDI F 


Pe roxidase 




TOBRBPCO 


RuBPC small enhtmit 




TOBTHAUR 








homologous 






tf\ that into tin 




AVOCEL 








Ch&lcone synthase 




PETCAB13 


CAB 13 




PETCAB22L 


CAB 22L 




PETCAB22R 


CAB22R 




PETCAB25 


CAB 25 




PETCAB37 


CAB 37 




PETCAB91R 


CAB91R 




PETCHSR 


Chaloone synthase 




PETGCR1 


Glycine-rich protein 




PETRBCS08 


RuBPC small subunit 




PETRBCS11 


RuBPC small subunit 






70 kDa heat shock 


15 




protein 




rnVCHM 


Chitinase 




rHVDLcCA 


Phytohemogglutinin E 




PHVni FPU 


Phyto hemagglutinin L 




PHVGSR1 


Glutaoiine synthetase 1 




PHVGSR2 


frllltflmin^ c \rr\ t Facf 1 




PHV1 BA 


Leghemogobin 




PHVLECT 






PHVPAL 


Phenylalanine ammonia 






lyase 




PHVPHASAR 


<x phased in 




PHVPHASBR 


ft phaseolio 






Arcelin seed protein 


16 




Chaloone synthase 


17 


PEAALB2 


Seed albumin 




PEACABSO 


CAB 




PEAGSR1 


Glutamine synthetase 






(nodule) 




PEALECA 


Lectin 




PEALEGA 


Le gitalin 




PEARUBPS 


RuBPC small subunit 




PEAVIC2 


Vicilin 




PEAVIC4 


Vic ilia 




PEAVIC7 


Vicilin 






Alcohol dehydrogenase 1 


18 




Glutamine synthetase 


19 




(leaf) 






Glutamine synthetase 


19 




(root) 






His tone 1 


20 




Nuclear encoded 


4 




chloroplast 






heat shock protein 






RuBPC small subunit 


21 


RCCAGG 


Agglutinin 




RCCRICIN 


Ricin 




RCCICL4 


Isocitratc lyase 




SIPFDX 


Fer redox in precursor 





5 Solatium tuberosum 



10 



]5 Spinacia oleracea 



Vicia faba 



25 



PfYTPAT 


Pa tat in 


POTINH-nX/l 

rvJiif\riwi 


Wound-induced 




proteinase 




inhibitor 


r\Jl LoJu 


Light-inducible tissue 




specific 




ST- LSI gene 


rLJll2G 


Wound -induced 




proteinase 




inhibitor II 


POTRBCS 


RuBPC small subunit 




Sucrose synthetase 


SPIACPI 


Acyl carrier protein 1 


SPIOEC16 


16 kDa photosynthetic 




oxygen-evolving protein 


SPIOEC23 


23 kDa photosynthetic 




oxygen-evolving protein 


SPIPCG 


Plastocyanin 


SPIPS33 


33 kDa photosynthetic 




water 




oxidation complex 




precursor 




Glycolate oxidase 


VFALBA 


Leg hemoglobin 


VFALEB4 


Legumin B 




Vicillin 



Pooled 53 monocot coding sequences obtained from Genbank 
(release 55) or, when no Genbank file name is specified, 
directly from the published source, were: 



23 



24 



30 


GENUS/SPECIES 


GENBANK 


PROTEIN 


REF 




Avena saliva 


ASTAP3R 


Phytochrome 3 






Hordeum vulgare 


BLYALR 


Aleuraio 








BLYAMY1 


a amylase 1 








BLYAMY2 


a amylase 2 




35 




BLYCHORD1 


Hordein C 








BLYGLUCB 


ft glucanase 1 








BLYHORB 


Bl hordein 








BLYPAPI 


Am ylase/pro tease 










inhibitor 








BLYTH1AR 


Toxin a hordothionin 




40 




BLYUBIQR 


Ubiquitin 








Histone 3 


25 








Leaf specific thionin 1 


26 








Leaf specific thionin 2 


26 




Oryta sativa 




Plastocyanin 


27 




RICGLUTG 


Glutei in 




45 






Glutei in 


28 


Triticum aestivum 


WHTAMYA 


a amylase 








WHTCAB 


CAB 








WHTEMR 


Em protein 








WHTGIR 


gibberellin responsive 










protein 








WHTGLGB 


y gliadin 




50 




WHTGL1ABA 


a/0 gliadin Class All 








WHTGLIJT1 


High MW glutcnin 








WHTH3 


Histone 3 








WHTH4091 


Histone 4 








WHTRBCB 


RuBPC small subunit 






Secale cerealc 


RYESECGSR 


y secalin 




55 


Zea mays 


MZEA1G 


40.1 kD Al • 










protein (NADPH- 










dependent reductase) 








MZEACT1G 


Actin 








MZEADH11F 


Alcohol dehydrogenase 1 








J MZEADH2NR 


Alcohol dehydrogenase 2 




60 




MZEALD 


Aldolase 








MZEANT 


ATP/ADP translocator 








MZEEG2R 


Glutei in 2 








MZEGGST3B 


Glutathione S transferase 








MZEH3C2 


Histone 3 








MZEH4C1 4 


Histone 4 




65 




MZEHSP701 


70 kD Heat shock 










protein, exon 1 





21 
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Frequency of Codon Usage 



MZEHSP702 


70 kD Heat shock 




protein, exon 2 


MZELHCP 


CAB 


MZEMPL3 


Lipid body surface 




protein L3 


MZEPEPCR 


Phosphoenolyruvate 




carboxylase 


MZERBCS 


RuBPC small subunit 


MZESUSYSG 


Sucrose synthetase 


MZETPI2 


Triosep hosp hate 




isomerase 1 


MZEZEA20M 


19 kD zein 


MZEZEA30M 


19 kD zein 


MZEZE15A3 


15 kD zein 


MZEZE16 


16 kD zein 


MZEZE19A 


19 kD zein 


MZEZE22A 


22 kD zein 


MZEZE22B 


22 kD zein 




Catalase 2 29 




Regulatory Cl locus 30 



10 



Bt codons were obtained from analysis of coding sequences of the following 
genes: 

Bt var. kurstaki HD-73, 6.6 kb HindlH fragment (Kronstad et al. (198T) J 
Bacterid. 154:419-428); 

Bt var. kurstaki HD-1, 5.3 kb fragment (Adang et al. (1987) in Biotechnology 
in In%xrtebrate Pathology and Cell Culture, K. Maramorosh (ed), Academic 
Press, Inc. New York, pp. 85-99); 

Bt var. kurstaki HD-1, 4.5 kb fragment (Schnepf and Whiteley (1985) J Biol 
Chem. 260:6273-6280); and 

Bt var. tenebrionis, 3.0 kb HindlH fragment (Sekar et al. (1987) Proc. Natl. 
Acad. Sci. 84:7036-7040). 
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For example, dicols utilize the AAG codon for lysine with 
a frequency of 61% and the AAA codon with a frequency of 
39%. In contrast, in Bt proteins the lysine codons AAG and 
AAA are used with a frequency of 13% and 87%, respec- 
tively. It is known in the art that seldom used codons are 60 
generally detrimental to that system and must be avoided or 
used judiciously. Thus, in designing a synthetic gene encod- 
ing the Btt crystal protein, individual amino acid codons 
found in the original Btt gene are altered to reflect the codons 
preferred by dicot genes for a particular amino acid. 65 
However, attention is given to maintaining the overall 
distribution of codons for each amino acid within the coding 



55 



region of the gene. For example, in the case of alanine, it can 
be seen from Table 1 that the codon GCA is used in Bt 
proteins with a frequency of 50%, whereas the codon OCT 
is the preferred codon in dicot proteins. In designing the 
synthetic Btt gene, not all codons for alanine in the original 
Bt gene are replaced by GCT; instead, only some alanine 
codons are changed to GCT while others are replaced with 
different alanine codons in an attempt to preserve the overall 
distribution of codons for alanine used in dicot proteins. 
Column C in Table 1 documents that this goal is achieved; 
the frequency of codon usage in dicot proteins (column A) 
corresponds very closely to that used in the synthetic Btt 
gene (column Q. 

In similar manner, a synthetic gene coding for insecticidal 
15 crystal protein can be optimized for enhanced expression in 
monocot plants. In Table 1, column D, is presented the 
frequency of codon usage of highly expressed monocot 
proteins. 

Because of the degenerate nature of the genetic code, only 
20 part of the variation contained in a gene is expressed in this 
protein. It is clear that variation between degenerate base 
frequencies is not a neutral phenomenon since systematic 
codon preferences have been reported for bacterial, yeast 
and mammalian genes. Analysis of a large group of plant 
gene sequences indicates that synonymous codons are used 
differently by monocots and dicots. These patterns are also 
distinct from those reported for E. coli, yeast and man. 

In general, the plant codon usage pattern more closely 
resembles that of man and other higher eukaryotes than 
unicellular organisms, due to the overall preference for G+C 
content in codon position HI. Monocots in this sample share 
the most commonly used codon for 13 of 18 amino acids as 
that reported for a sample of human genes (Grantham el al. 
(1986 supra), although dicots favor the most commonly used 
human codon in only 7 of 18 amino acids. 

Discussions of plant codon usage have focused on the 
differences between codon choice in plant nuclear genes and 
in chloroplasts. Chloroplasts differ from higher plants in that 
they encode only 30 tRNA species. Since chloroplasts have 
restricted their tRNA genes, the use of preferred codons by 
chloroplast-encoded proteins appears more extreme. 
However, a positive correlation has been reported between 
the level of isoaccepting tRNA for a given amino acid and 
the frequency with which this codon is used in the chloro- 
plast genome (Pfitzinger et al. (1987) Nucl. Acids Res. 
15:1377-1386). 

Our analysis of the plant genes sample confirms earlier 
reports that the nuclear and chloroplast genomes in plants 
have distinct coding strategies. The codon usage of mono- 
cots in this sample is distinct from chloroplast usage, sharing 
the most commonly used codon for only 1 of 18 amino acids. 
Dicots in this sample share the most commonly used codon 
of chloroplasts in only 4 of 18 amino acids. In general, the 
chloroplast codon profile more closely resembles that of 
unicellular organisms, with a strong bias towards the use of 
A+T in the degenerate third base. 

In unicellular organisms, highly expressed genes use a 
smaller subset of codons than do weakly expressed genes 
although the codons preferred arc distinct in some cases. 
Sharp and Li (1986) Nucl. Acids Res. 14:7734-7749 report 
that codon usage in 165 E. coli genes reveals a positive 
correlation between high expression and increased codon 
bias. Bennetzen and Hall (1982) supra have described a 
similar trend in codon selection in yeast. Codon usage in 
these highly expressed genes correlates with the abundance 
of isoaccepting tRNAs in both yeast and E coli. It has been 
proposed that the good fit of abundant yeast and E. coli 
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mRNA codon usage to isoacceptor tRNA abundance pro- 
motes high translation levels and high steady stale levels of 
these proteins. This strongly suggests that ihe potential for 
high levels of expression of plant genes in yeast or E. colt 
is limiled by their codon usage. Hoekema et al. (1987) supra 5 
report that replacement of the 25 most favored yeast codons 
with rare codons in the 5' end of the highly expressed gene 
PGK1 leads to a decrease in both mRNA and protein. These 
results indicate that codon bias should be emphasized when 
engineering high expression of foreign genes in yeast and 
other systems. 

(iii) Sequences within the Btt Coding Region having Poten- 
tially Destabilizing Influences 

Analysis of the Btt gene reveals that the A+T content 
represents 64% of the DNA base composition of the coding 
region. This level of A+T is about 10% higher than that 15 
found in a typical plant coding region. Most often, high A+T 
regions are found in intergenic regions. Also, many plant 
regulatory sequences are observed to be AT-rich. These 
observations lead to the consideration that an elevated A+T 
content within the Btt coding region may be contributing to 20 
a low expression level in plants. Consequently, in designing 
a synthetic Btt gene, the A+T content is decreased to more 
closely approximate the A+T levels found in plant proteins. 
As illustrated in Table 3, the A+T content is lowered to a 
level in keeping with that found in coding regions of plant ^ 
nuclear genes. The synthetic Btt gene of this invention has 
an A+T content of 55%. 

TABLE 3 



Adenine 


+ Thymine Content in 


Btt Coding Region 


30 






Base 






Codiog Region 


G 


A T 


C % G +C 


% A + T 


Natural Btt gene 
Synthetic J3u gene 


341 
392 


633 514 
530 483 


306 36 
428 45 


64 35 
55 



In addition, the natural Btt gene is scanned for sequences 
that are potentially destabilizing to Btt RNA. These 
sequences, when identified in the original Btt gene, are 
eliminated through modification of nucleotide sequences. 
Included in this group of potentially destabilizing sequences 
are: 

(a) plant polyadenylation signals (as described by Joshi 
(1987) Nucl. Acids Res. 15:9627-9640). In eukaryotes, 
the primary transcripts of nuclear genes are extensively 
processed (steps including 5' -capping, intron splicing, 
polyadenylation) to form mature and translatable 
mRNAs. In higher plants, polyadenylation involves 
endocucleolytic cleavage at the polyAsite followed by 
the addition of several A residues to the cleaved end. 
The selection of the polyA site is presumed to be 
cis-regulated. During expression of Bt protein and RNA 
in different plants, the present inventors have observed 
that the polyadenylated mRNA Isolated from these 
expression systems is not full-length but instead is 
truncated or degraded. Hence, in the present invention 
it was decided to minimize possible destabilization of 
RNA through elimination of potential polyadenylation 
signals within the coding region of the synthetic Btt 
gene. Plant polyadenylation signals including 
AATAAA, AATGAA, AATAAT, AATATT, GATAAA, 
GATAAA, and AATAAG motifs do not appear in the 
synthetic Btt gene when scanned for 0 mismatches of 
the sequences. 

(b) polymerase II termination sequence, CAN 7 . 9 AGT- 
NNAA. This sequence was shown ( Vankan and Filip- 



owicz (1988) EMBO J. 7:791-799) to be next to the 3' 
end of the coding region of the U2 snRNA genes of 
Arabidopsis thaliana and is believed to be important 
for transcription termination upon 3 f end processing. 
The synthetic Btt gene is devoid of this termination 
sequence. 

(c) CUUCGG hairpins, responsible for extraordinarily 
stable RNA secondary structures associated with vari- 
ous biochemical processes (Tuerk et al. (1988) Proc. 
Natl. Acad. Sci. 85:1364-1368). The exceptional sta- 
bility of CUUCGG hairpins suggests that they have an 
unusual structure and may function in organizing the 
proper folding of complex RNA structures. CUUCGG 
hairpin sequences are not found with either 0 or 1 
mismatches in the Btt coding region. 

(d) plant consensus splice sites, 5'=AAG: GTAAGT(S EQ 
ID NO:6) and 3'=TTTT(Pu)TTT(Pu)T(Pu)T(Pu)T(Pu) 
TGCAG:C(SEQ ID NO:7), as described by Brown et 
al. (1986) EMBO J. 5:2749-2758. Consensus 
sequences for the 5' and 3' splice junctions have been 
derived from 20 and 30 plant intron sequences, respec- 
tively. Although it is not likely that such potential splice 
sequences are present in Bt genes, a search was initiated 
for sequences resembling plant consensus splice sites in 
the synthetic Btt gene. For the 5' splice site, the closest 
match was with three mismatches. This gave 12 
sequences of which two had G:GT. Only position 948 
was changed because 1323 has the Kpnl site needed for 
reconstruction. The 3'-splice site is not found in the 
synthetic Btt gene. 

Thus, by highlighting potential RNA-destabilizing 
sequences, the synthetic Btt gene is designed to eliminate 
known eukaryotic regulatory sequences that affect RNA 
synthesis and processing. 

Example 2 

Chemical Synthesis of a Modified Btt Structural 
Gene 

40 (i) Synthesis Strategy 

The general plant for synthesizing linear double-stranded 
DNA sequences coding for the crystal protein from Btt is 
schematically simplified in FIG. 2. The optimized DNA 
coding sequence (FIGS. 1A-C) is divided into thirteen 

45 segments (segments A-M) to be synthesized individually, 
isolated and purified. As shown in FIG. 2, the general 
strategy begins by enzymatically joining segments A and M 
to form segments AM to which is added segment BL to form 
segment ABLM. Segment CK is then added enzymatically 

50 to make segment ABCKLM which is enlarged through 
addition of segments DJ, EI and RFH sequentially to give 
finally the total segment ABCDEFGHIJKLM, representing 
the entire coding region of the Btt gene. 

FIG. 3 outlines in more detail the strategy used in cora- 

55 bining individual DNA segments in order to effect the 
synthesis of a gene having unique restriction sites integrated 
into a defined nucleotide sequence. Each of the thirteen 
segments (A to M) has unique restriction sites at both ends, 
allowing the segment to be strategically spliced into a 

60 growing DNA polymer. Also, unique sites are placed at each 
end of the gene to enable easy transfer from one vector to 
another. 

The thirteen segments (A to N) used to construct ihe 
synthetic gene vary in size, oligonucleotide pairs of approxi- 
65 mately 75 nucleotides each are used to construct larger 
segments having approximately 225 nucleotide pairs. FIG. 3 
documents the number of base pairs contained within each 
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segment and specifies the unique restriction sites bordering 65° C, solutions are combined, cooled slowly to room 
each segment. Also, the overall strategy to incorporate temperature and kept on ice until used. The ligatcd mixture 
specific segments at appropriate splice sites is detailed in may be purificd by electrophoresis through a 4% NuSieve 

(ii) Preparation of Oligodeoxy nucleotides 5 ^° SC (FM 9 f e !; ™?> nd co ^ponding to the ligaled 

Preparation of oligodeoxynucleotides for use in the syn- d «P tex * eXCISCd ' lhe DNAls extracted frora the a & arose and 

thesis of a DNA sequence comprising a gene for Btt is elhano1 P rec, P ltaled 

carried out according to the general procedures described by Ligations are carried out as exemplified by that used in M 

Matteucci et al. (1981) J. Am. Chem. Soc. 103:3185-3192 segment ligations. M segment DNA is brought to 65° C. for 

and Beaucage et al. (1981) Tetrahedron Lett. 22:1859-1862. 10 25 rain, the desired vector is added and the reaction mixture 

Ail oligonucleotides are prepared by the solid-phase phos- is incubated at 65° G for 15 min. The reaction is slow cooled 

phoramidite triester coupling approach, using an Applied over IVi hours to room temperature. ATP to 05 mM and 3.5 

Biosystems Model 380A DNA synthesizer. Deprotection units of T4 DNA ligase salts are added and the reaction 

and cleavage of the oligomers from the solid support are mixture is incubated for 2 hr at room temperature and then 

earned out according to standard procedures. Crude oligo- 15 maintained overnight at 15° C. The next morning, vectors 

nucleotide mixtures are purified using an oligonucleotide which had not been ligated to M block DNA were removed 

H^5h!H° n K "2 [ n g % (0T , C ' i AP ^ o C 0 l > as upon linearization by EcoRI digestion. Vectors ligated to the 

described by McBride et al. (1988) Biotechniques M DNA are ^ t0 \ ansform E . C J MC1061 

' S'-phosphorylation of oligonucleotides is performed with 20 u£^T^^u^ "H? f"? * ^ ^ 

T4 polynucleotide kinase. Hie reaction contains 2 « oli- h y bndlzatl ° D with P-labelled ohgonucleotide probes. The 

gonucleotide and 18.2 units polynucleotide kinase s ^ uen f °f *** DNA segment is confirmed by isolating 

(Pharmacia) in linker kinase buffer (Maniatis (1982) Clon- P lasmid DNA and sequencing using the dideoxy method of 

ing Manual, Fritsch and Sambrook (eds.), Cold Spring Sanger et aL < 1977 > ^ NatL Acad - Sci 74:5463-5467. 

Harbor Laboratory, Cold Spring Harbor, N.Y.). The reaction ^ P") Synthesis of Segment AM 

is incubated at 37° C. for 1 hour. Three oligonucleotide pairs (Al and its complementary 

Oligonucleotides are annealed by first heating to 95° C. strand Ale, A2 and A2c and A3 and A3c) are assembled and 

for 5 min. and then allowing complementary pairs to cool ligated as described above to make up segment A The 

slowly to room temperature. Annealed pairs are reheated to nucleotide sequence of segment A is as follows: 



TABLE 4 



Nucleotide Sequence of Segment A 



BamHI 

XhoII Petl 
I I 
AATTGGGATC CAAC AATGGC TGCAGACAAC AACACGGAGGCC CT CG AT AGC TCTAC CACC 



1 



60 



CCCTAGGTTGTTACCGACGTCTGTTGTTGTGCCTCCGGGAGCTATCGAGATGGTGG 
EcoRI end 

MAADNNTEALDSSTT 



AAAGATCTCATlrCAGAAGGGCATCTCCGTTGTGGGTGATCTCCTTGGCGTTGTTGGTTTC 
61 +J + + + + 120 

TTTCTACAGTAAGTCTTCCOpTAGAGGCAACACCCACTAGAGGAACCGCAAC^CCAAAG 
KDVIQKG ISVVGDLLGVVGF 

BspXII 
Ban I I 

1 1 . 

CCCTrTGGTGGTGCCCTTGTTTCGTTfcTACACTAACTTTCTGAATACTATTTGGCCCAGC 

191 t- . j. I i j_ . 



Al (71 bases) 
Ale* (76 bases) 



A2 (75 bases) 
A2c (76 bases) 



GGGAAACCACCACGGGAACAAAGCAAGATGTGATT^AAGACTTATGATAAACCGGGTCG 
PFGGALVSFYTNFLNTIWPS 

XhoII A3 (82 bases) 

Bglll EcoRI 

Hindlll I Xbal I A3c (76 bases) 

I I I I end 
G AAGACCCTTGGAAGGCTTTT ATGGAGCAAGTGGAAGCTTAGATC TAG 
181 + + + + + 232 

CTTCTGGGAACCTTCCG AAAATACCTCGTTC AC CTTC GAATCTAGATCTTAA 
EDPWKAFMEQVE 
*c " complementary strand. 
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In Table 4, bold lines demarcate the individual oligonucle- 
otides. Fragment Al contains 71 bases, Ale has 76 bases, A2 
has 75 bases, A2c has 76 bases, A3 has 82 bases and A3c has 
76 bases. In all, segment A is composed of 228 base pairs 
and is contained between EcoRI restriction enzyme site and 
one destroyed EcoRI site (5')J. (Additional restriction sites 
within Segment A are indicated.) The EcoRI single-stranded 
cohesive ends allow segment A to be annealed and then 
ligated to the EcoRI-cut cloning vector, pIC20K. 

Segment M comprises three oligonucleotide pairs: Ml, 80 
bases, Mlc, 86 bases, M2, 87 bases, M2c, 87 bases, M3, 85 
bases and M3c 79 bases. The individual oligonucleotides are 
annealed and ligated according to standard procedures as 
described above. The overall nucleotide sequence of seg- 
ment M is: 



10 
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As proposed in FIG. 3, segment M is joined to segment A 
in the plasmid in which it is contained. Segment M is excised 
at the flanking restrictions sites from its cloning vector and 
spliced into pIC2()K, harboring segment A, through succes- 
sive digestions with Hindlll followed by Bglll. The pIC20K 
vector now comprises segment A joined to segment M with 
a Hindlll site at the splice site (see FIG. 3). Plasmid pIC20K 
is derived from pIC20R by removing the Scal-Ndel DNA 
fragment and inserting a Hindi fragment containing an 
NPTI coding region. The resulting plasmid of 4.44 kb 
confers resistance to kanamycin on E. colt. 

Example 3 

Expression of Synthetic Crystal Protein Gene in 
Bacterial Systems 
The synthetic Btt gene is designed so that it is expressed 
in the pIC20R-kan vector in which it is constructed. This 



TABLE 5 



Nucleotide Sequence of Segment M 



Hindlll BspXII 
Eco I Banlll 

Rl-end | | | 

AATTAAGCTTGGACGGGGCTCCATTCAACCAATACTACTTCGATAAGACCATCAACAAAG 
1 + + + + + + 

TTCGAACCTGCCCCGAGGTAAGTTGGTTATGATGAAGCTATTCTGGTAGTTGTTTC 



61 



DGAPFNQYYFDKTINKG 



Asull 

■ I 

GAGACACACTCACGTATAATtrCCTTCAACTTAGCCAGCTTCAGCACTCCATTCGAATTCT 



CTCTGTGTGAGTGCATATTAAGGAAGTTGJ^TCGGTCGAAGTCGTGAGGTAAGCTTAACA 
DTLTYNSFNLASFSTPFELS 



AccI 

Ahall TthI I 

i il 

CAGGGAACAACTTGCAGATAGGCGTCACAGGATTGAGTGCTGGTGACf^GGTCTACATCG 

+ + + + T i + 

GTCCCTTGTTG AACGTCT ATC CGCAGTGTCC TAACTC ACGACC AC TGTTCCAGATGTRGC 
GNNLQIGVTGLSAGDKVYID 



120 



Ml (80 bases) 
Mlc* (86 bases) 



M2 (87 bases) 
M2c (87 bases) 



Mstll 
I 

ACAAGATTGAGTTCATTCCAGTGAACCTTAGGTCCCCAGGAACCGAGCTTGAGTTCATCG 

181 4 + + + + + 240 

TGTTCTAAC TC AAGT AAGGTC AC TTGGAATCCAGGGGTCCTTGGCTCGAACTC AAGT AGC 



P V N 



SPGTELEFI 



Bglll 
XhoII 
Xbal I 
I I 
ACATC TAGATC T 

241 + 2S6 

TGTAGATCTAGATTAA 



M3 (85 bases) 
M3c (79 bases) 



252 BASES (TOTAL) 



*c « complementary strand 

In Table 5 bold lines demarcate the individual oligonucle- 
otides. Segment M contains 252 base pairs and has 
destroyed EcoRI, restriction sites at both ends. (Additional 
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expression is produced utilizing the initiation methionine of 
the lacZ protein of pIC20K. The wild-type Btt crystal protein 
sequence expressed in this manner has full insecticidal 

restriction sites within segmen. Mare indicated). Segment 6S « ad * Xi ° a ' tl, ° synthetic gene is designed to 

XA • ■ , . »™«« ^ . . contain a BamH I site 5' proximal to the initia ting methionine 

M is inserted into vector pIC20R at an EcoRI restriction site C odon and a Bglll site 3' to the terminal TAG translation stop 

and cloned. codon. This facilitates the cloning of the insecticidal crystal 
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protein coding region into bacterial expression vectors such 
as pDR540 (Russell and Bennett, 1982). Plasrnid pDR540 
contains the TAC promoter which allows the production of 
proteins including Bti crystal protein under controlled con- 
ditions in amounts up to 10% of the total bacterial protein, s 
This promoter functions in many gram-negative bacteria 
including E. coli and Pseudomonas. 

Production of Bt insecticidal crystal protein from the 
synthetic gene in bacteria demonstrates that the protein 
produced has the expected toxicity to coleopteran insects. 10 
These recombinant bacterial strains in themselves have 
potential value as microbial insecticides, product of the 
synthetic gene. 

Example 4 15 

Expression of a Synthetic Crystal Protein Gene in 
Plants 

The synthetic Btt crystal protein gene is designed to ^ 
facilitate cloning into the expression cassettes. These utilize 
sites compatible with the BamHI and BgUI restriction sites 
flanking the synthetic gene. Cassettes are available that 
utilize plant promoters including CaMV 35S, CaMV 19S 
and the ORF 24 promoter from T-DNA. These cassettes 
provide the recognition signals essential for expression of ~ 
proteins in plants. These cassettes are utilized in the micro 
Ti plasmids such as pH575. Plasmids such as pH575 con- 



taining the synthetic Btt gene directed by plant expression 
signals are utilized in disarmed Agrobacteriwn tiuttefaciens 
to introduce the synthetic gene into plant genomic DNA. 
This system has been described previously by Adang et al. 
(1987) to express Bt var. kurstaki crystal protein gene in 
tobacco plants. These tobacco plants were toxic to feeding 
tobacco hornworms. 

Example 5 
Assay for Insecticidal Activity 

Bioassays are conducted essentially as described by 
Sekar, V. et al. supra. Toxicity is assessed by an estimate of 
the LD 50 . Plasmids are grown in E. coli JM105 (Yanisch- 
Perron, C. et al. (1985) Gene 33:103-119). On a molar basis, 
no significant differences in toxicity are observed between 
crystal proteins encoded by p544Pst-Met5, p544-HindIII, 
and pNSBP544. When expressed in plants under identical 
conditions, cells containing protein encoded by the synthetic 
gene are observed to be more toxic than those containing 
protein encoded by the native Btt gene. Immunoblots 
("western" blots) of cell cultures indicated that those that are 
more toxic have more crystal protein antigen. Improved 
expression of the synthetic Btt gene relative to that of a 
natural Btt gene is seen as the ability to quantitate specific 
mRNA transcripts from expression of synthetic Btt genes on 
Northern blot assays. 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES: 9 

(2) INFORMATION FOR SEQ ID NO":l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATGACTGCAG ATAATAATAC GGAAGCACTA GATAGCTCTA CAACAAAAGA TGTCATTCAA 60 

AAAGGCATTT CCGTAGTAGG TGATCTCCTA GGCGTAGTAG GTTTCCCGTT TGGTGGAGCG 120* 

CTTGTTTCGT TTTATACAAA CTTTTTAAAT AC TATTTGGC CAAGTGAAGA CCCGTGGAAG 180 

GCTTTTATGG AACAAGTAGA AGCATTGATG GATCAGAAAA TAGCTGATTA TGCAAAAAAT 240 

AAAGCTCTTG CAGAGTTACA GGGCCTTCAA AATAATGTCG AAGATTATGT GAGTGCATTG 300 

AGTTCATGGC AAAAAAATCC TGTGAGTTCA CGAAATCCAC ATAGCCAGGG GCGGATAAGA 360 

GAGCTGTTTT CTCAAGCAGA AAGTCATTTT CGTAATTCAA TGCCTTCGTT TGCAATTTCT 420 

GGATACGAGG TTCTATTTC T AACAACATAT GCACAAGCTG CCAACACACA TTTATTTTTA 4 80 

CTAAAAGACG CTCAAATTTA TGGAGAAGAA TGGGGATACG AAAAAGAAGA TATTGCTGAA 5 40 

TTTTATAAAA GACAACTAAA ACTTACGCAA GAATATACTG ACCATTGTGT CAAATGGTAT 600 

AATGTTGGAT TAGATAAATT AAGAGGTTCA TC TT ATGAAT CTTGGGTAAA CTTTAACCGT 660 

TATCGCAGAG AGATGACATT AACAGTATTA GATTTAATTG CACTATTTCC ATTGTATGAT 720 
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GTTCGGCTAT ACCCAAAAGA AGTTAAAACC GAATTAACAA GAGACGTTTT AACAGATCCA 780 

ATTGTCGGAG TCAACAACCT TAGGGGCTAT GGAACAACCT TCTCTAATAT AGAAAATTAT 840 

ATTCGAAAAC CACATCTATT TGACTATCTG CATAGAATTC AATTTCACAC GCGGTTCCAA 900 

CCAGGATATT ATGGAAATGA CTCTTTCAAT TATTGGTCCG GTAATTATGT TTCAACTAGA 960 

CCAAGCATAG GATCAAATGA TATAATCACA TCTCCATTCT ATGGAAATAA ATCCAGTGAA 1020 

CCTGTACAAA ATTTAGAATT TAATGGAGAA AAAGTCTATA GAGCCGTAGC AAATACAAAT 1080 

CTTGCGGTCT GGCCGTCCGC TGTATATTCA GGTGTTACAA AAGTGGAATT TAGCCAATAT X14 0 

AATGATCAAA CAGATGAAGC AAGTACACAA ACGTACGACT CAAAAAGAAA TGTTGGCGCG 1200 

GTCAGCTGGG ATTCTATCGA TCAATTGCCT CCAGAAACAA CAGATGAAGC TCTAGAAAAG 1260 

GGATATAGCC ATCAACTCAA TTATGTAATG TGCTTTTTAA TGCAGGGTAG TAGAGGAACA 1320 

ATCCCAGTGT TAACTTGGAC ACATAAAAGT GTAGACTTTT TTAACATGAT TGATTCGAAA ,1380 

AAAATTACAC AACTTCCGTT AGTAAAGGCA TATAAGTTAC AATCTGGTGC TTCCGTTGTC 1440 

GCAGGTCCTA GGTTTACAGG AGGAGATATC ATTCAATGCA CAGAAAATGG AAGTGCGGCA 1500 

ACTATTTACG TTACACCGGA TGTGTCGTAC TCTCAAAAAT ATCGAGCTAG AATTCATTAT 1560 

GCTTCTACAT CTCAGATAAC ATTTACACTC AGTTTAGACG GGGCACCATT TAATCAATAC 1620 

TATTTCGATA AAACGATAAA TAAAGGAGAC ACATTAACGT ATAATTCATT TAATTTAGCA 1680 

AGTTTCAGCA CACCATTCGA ATTATCAGGG AATAACTTAC AAATAGGCGT CACAGGATTA 1740 

AGTGCTGGAG ATAAAGTTTA TATAGACAAA ATTGAATTTA TTCCAGTGAA TTAA 1794 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 
15 10 15 

Asp Val He Gin Lye Gly He Ser Val Val Gly Asp Leu Leu Gly Val 
20 25 30 

Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 
35 40 45 

Leu Asn Thr He Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 
50 55 60 

Gin Val Glu Ala Leu Met Asp Gin Lys He Ala Asp Tyr Ala Lys Asn 
65 70 75 80 

Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Val Glu Asp Tyr 
85 90 95 

Val Ser Ala Leu Ser Ser Trp Gin Lys Asn Pro Val Ser Ser Arg Asn 
100 105 110 

Pro His ser Gin Gly Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
115 120 125 

His Phe Arg Asn Ser Met Pro Ser Phe Ala He Ser Gly Tyr Glu Val 
130 135 140 

Leu Phe Leu Thr Thr Tyr Ala Gin Ala Ala Asn Thr Hio Leu Phe Leu 
145 150 155 160 
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Leu Lys Asp Alo Gin He Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu 
165 170 175 

Asp lie Ala Glu Phe Tyr Lys Arg Gin Leu Lys Leu Thr Gin Glu Tyr 
180 185 190 

Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg 
195 200 205 

Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu 
2L0 215 220 

Met Thr Leu Thr Val Leu Asp Leu He Ala Leu Phe Pro Leu Tyr Asp 
225 230 235 240 

Val Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val 
245 250 255 

Leu Thr Asp Pro He Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr 
260 265 270 

Thr Phe Ser Asn He Glu Asn Tyr He Arg Lys Pro His Leu Phe Asp 
275 280 285 

Tyr Leu His Arg He Gin Phe His Thr Arg Phe Gin Pro Gly Tyr Tyr 
290 295 300 

Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg 
305 310 315 320 

Pro Ser lie Gly Ser Asn Asp He He Thr Ser Pro Phe Tyr Gly Asn 
325 330 335 

Lys Ser Ser Glu Pro Val Gin Asn Leu Glu Phe Asn Gly Glu Lys Val 
340 345 350 

Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val 
355 360 365 

Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gin Tyr Asn Asp Gin Thr 
370 375 380 

Asp Glu Ala Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala 
385 390 * 395 400 

Val Ser Trp Asp Ser He Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu 
405 410 415 

Pro Leu Glu Lys Gly Tyr Ser His Gin Leu Asn Tyr Val Met Cys Phe 
420 425 430 

Leu Met Gin Gly Ser Arg Gly Thr He Pro Val Leu Thr Trp Thr His 
435 440 445 

Lys Ser Val Asp Phe Phe Asn Met He Asp Ser Lys Lys He Thr Gin 
450 455 460 

Leu Pro Leu Val Lys Ala Tyr Lys Leu Gin Ser Gly Ala Ser Val Val 
465 470 475 480 

Ala Gly Pro Arg Phe Thr Gly Gly Asp He He Gin Cys Thr Glu Asn 
485 490 495 

Gly Ser Ala Ala Thr He Tyr Val Thr Pro Asp Val Ser Tyr Ser Gin 
500 505 510 

Lys Tyr Arg Ala Arg He His Tyr Ala Ser Thr Ser Gin He Thr Phe 
515 520 525 

Thr Leu Ser Leu Asp Gly Ala Pro Phe Asn Gin Tyr Tyr Phe Asp Lys 
530 535 540 

Thr He Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu Ala 
545 550 555 560 

Ser Phe Ser Thr Pro Phe Glu Leu Ser Gly Aon Asn Leu Gin He Gly 
565 570 575 
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Vol Thr Gly Leu Ser Ala Gly Asp Lys Val Tyr He Asp Lys He Glu 
580 585 590 

Phe He Pro Val Asn 

595 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1B33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE : cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ATGGCTGCAG ACAACAACAC GGAGGCCCTC GATAGCTCTA CCACCAAAGA TGTCATTCAG 60 

AAGGGCATCT CCGTTGTGGG TGATCTCCTT GGCGTTGTTG GTTTCCCCTT TGGTGGTGCC 120 

CTTGTTTCGT TCTACACTAA CTTTCTGAAT ACTATTTGGC CCAGCGAAGA CCCTTGGAAG 180 

GCTTTTATGG AGCAAGTGGA AGCTTTGATG GATCAGAAGA TCGCTGATTA TGCAAAGAAC 240 

AAAGCTCTTG CTGAGCTCCA GGGCCTTCAG AACAACGTCG AAGATTATGT GAGTGCACTG 300 

AGTTCATGGC AAAAGAATCC TGTGTCCTCA CGAAATCCAC ATAGCCAGGG GCGCATAAGG 360 

GAGCTGTTCT CTCAAGCAGA AAGTCACTTC CGGAATTCAA TGCCTTCCTT TGCCATCTCT 420 

GGGTACGAGG TTCTCTTTCT TACAACCTAC GCTCAAGCTG CCAACACACA TCTGTTCTTA 480 

CTAAAAGACG CTCAAATCTA TGGTGAAGAA TGGGGATACG AGAAAGAAGA TATCGCTGAG 540 

TTCTACAAGC GTCAACTAAA ACTTACTCAA GAGTATACTG ACCACTGTGT CAAATGGTAT 600 

AATGTTGGAT TGGATAAGTT GAGAGGTTCA TCTTATGAAT CTTGGGTAAA CTTTAACCGG 660 

TACCGCAGAG AGATGACATT GACAGTGCTC GACTTGATTG CACTATTTCC ATTGTATGAT 720 

GTTCGACTCT ACCCAAAGGA GGTTAAAACC GAATTGACTA GAGACGTTTT AACCGATCCC 780 

ATTGTC GG AG TCAACAACCT CAGAGGCTAC GGAACAACCT TCTCTAACAT AGAAAAC TAC 840 

ATTCGTAAAC CACATCTATT CGACTATCTG CACAGAATTC AGTTTCACAC GCGGTTCCAA 900 

CCAGGATACT ATGGAAATGA CTCTTTCAAC TATTGGTCCG GTAATTATGT TTCAACTAGA 960 

CCCAGCATAG GATCTAATGA CATCATCACC TCTCCATTCT ACGGAAACAA GTCCTCCGAG 1020 

CCTGTGCAAA ACTTGGAGTT TAATGGAGAG AAAGTCTATA GAGCCGTGGC CAATACCAAT 1080 

CTTGCCGTCT GGCCGTCCGC TGTGTACTCA GGTGTTACCA AAGTGGAATT CAGCCAATAC 1140 

AATGATCAGA CAGATGAAGC AAGTACTCAA ACTTACGACT CAAAGAGGAA TGTTGGCGCG 1200 

GTCAGCTGGG ATTCTATCGA TCAACTCCCT CCAGAAACCA CCGATGAACC TCTAGAGAAG 1260 

GGTTATAGCC ATCAACTCAA TTACGTAATG TGCTTTCTCA TGCAGGGTAG TAGAGGTACC 1320 

ATCCCAGTGT TAACTTGGAC TCACAAGAGT GTAGACTTCT TCAACATGAT TGATTCGAAA 1380 

AAGATTACTC AACTTCCGTT GGTAAAGGCC TACAAGTTAC AATCTGGTGC TTCCGTTGTC 1440 

GCAGGTCCTA GGTTTACAGG AGGAGATATC ATTCAATGCA CTGAGAATGG GTCCGCGGCA 1500 

ACTATCTACG TTACACCTGA TGTGTCGTAC TCTCAAAAGT ATCGTGCTAG AATTCATTAT 1560 

GCTTCTACCT CTCAGATAAC ATTCACACTA AGCTTGGACG GGGCTCCATT CAACCAATAC 1620 

TACTTCGATA AGACCATCAA CAAAGGAGAC ACACTCACGT ATAATTCATT CAACTTAGCC 1680 

AGCTTCAGCA CTCCATTCGA ATTGTC AG GG AACAACTTGC AGATAGGCGT CACAGGATTG 174 0 

AGTGCTGGTG ACAAGGTTTA CATCGACAAG ATTGAGTTCA TTCCAGTGAA CCTTAGGTCC 1800 
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CCAGGAACCG AGCTTGAGTT CATCGACATC TAG 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ala Ala Asp Aon Aan Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 
15 10 15 

Asp Vttl He Gin Lys Gly He Ser Val Val Gly Asp Leu Leu Gly Val 
20 25 30 

Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 
35 40 45 

Leu Asn Thr He Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 
50 55 60 

Gin Val Glu Ala Leu Met Asp Gin Lys He Ala Asp Tyr Ala Lys Asn 
65 70 75 80 

Lyo Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Val Glu Asp Tyr 
85 90 95 

Val Ser Ala Leu Ser Ser Trp Gin Lys Asn Pro Val Ser Ser Arg Asn 
100 105 no 

Pro His Ser Gin Gly Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
115 120 125 

His Phe Arg Asn Ser Met Pro Ser Phe Ala He Ser Gly Tyr Glu Val 
130 135 140 

Leu Phe Leu Thr Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Phe Leu 
145 150 * 155 160 

Leu Lys Asp Ala Gin He Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu 
165 170 175 

Asp He Ala Glu Phe Tyr Lys Arg Gin Leu Lys Leu Thr Gin Glu Tyr 
ISO 185 190 

Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg 
195 200 205 

Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu 
210 215 220 



Met Thr Leu Thr Val Leu Asp Leu 
225 230 

Val Arg Leu Tyr Pro Lyo Glu Val 
245 

Leu Thr Asp Pro He Val Gly Val 
260 

Thr Phe Ser Asn He Glu Asn Tyr 
275 280 



He Ala Leu Phe Pro Leu Tyr Asp 
235 240 

Lys Thr Glu Leu Thr Arg Asp Val 
250 255 

Asn Asn Leu Arg Gly Tyr Gly Thr 
265 270 

He Arg Lys Pro His Leu Phe Asp 
285 



Tyr Leu His Arg He Gin Phe His Thr Arg Phe Gin Pro Gly Tyr Tyr 
290 295 300 

Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg 
305 310 315 320 

Pro Ser He Gly Ser Asn Asp He He Thr Ser Pro Phe Tyr Gly Asn 
325 330 335 
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Lys Ser Ser Glu Pro Vol Gin Asn Leu Glu Phe Asn Gly Glu Lys Val 
34 0 345 350 

Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val 
355 360 365 

Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gin Tyr Asn Asp Gin Thr 
37 ° 375 380 

Asp Glu Ala Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Val Glv Ala 
385 390 395 400 

Val Ser Trp Asp Ser lie Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu 
405 410 4 l5 

Pro Leu Glu Lys Gly Tyr Ser His Gin Leu Asn Tyr Val Met Cys Phe 
4 20 425 430 

Leu Met Gin Gly Ser Arg Gly Thr He Pro Val Leu Thr Trp Thr His 
435 440 445 

Lys Ser Val Asp Phe Phe Asn Met He Asp Ser Lys Lys He Thr Gin 
450 455 460 

Leu Pro Leu Val Lys Ala Tyr Lys Leu Gin Ser Gly Ala Ser Val Val 
465 470 475 480 

Ala Gly Pro Arg Phe Thr Gly Gly Asp He He Gin Cys Thr Glu Asn 
485 490 495 

Gly Ser Ala Ala Thr He Tyr Val Thr Pro Asp Val Ser Tyr Ser Gin 
500 505 sio 

Lys Tyr Arg Ala Arg He His Tyr Ala Ser Thr Ser Gin He Thr Phe 
515 520 525 

Thr Leu Ser Leu Asp Gly Ala Pro Phe Asn Gin Tyr Tyr Phe Asp Lys 
530 535 540 

Thr He Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu Ala 
545 550 555 560 

Ser Phe Ser Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gin He Gly 
565 ' 570 575 

Val Thr Gly Leu Ser Ala Gly Asp Lys Val Tyr He Asp Lye He Glu 
580 585 590 

Phe He Pro Val Asn Leu Arg Ser Pro Gly Thr Glu Leu Glu Phe He 
595 600 605 

Asp He 
610 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
CAGGATCCAA CAATGAC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO:6: 
AAGGTAAGT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
TTTTUTTTUT U TU TUT GC AG C 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8r 
AATTGGGATC CAACAATGGC TGCAGACAAC AACACGGAGG 
CCCTAGGTTG TTACCGACGT CTGTTGTTGT GCCTCCGGGA 
ATGTCATTCA GAAGGGCATC TCCGTTGTGG GTGATCTCCT 
TACAGTAAGT CTTCCCGTAG AGGCAACACC CACTAGAGGA 
TTGGTGGTGC CCTTGTTTCG TTCTACACTA ACTTTCTGAA 
AACCACCACG GGAACAAAGC AAGATGTGAT TGAAAGACTT 
ACCCTTGGAA GGCTTTTATG GAGCAAGTGG AAGCTTAGAT 
GAAAATACCT CGTTCACCTT CGAATCTAGA TCTTAA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
AATTAAGCTT GGACGGGGCT CCATTCAACC AATACTACTT 
TTCGAACCTG CCCCGAGGTA AGTTGGTTAT GATGAAGCTA 
CACACTCACG TATAATTCCT TCAACTTAGC CAGCTTCAGC 
GTGTGAGTGC ATATTAAGGA AGTTGAATCG GTCGAAGTCG 
GAACAACTTG CAGATAGGCG TCACAGGATT GAGTGCTGGT 
CTTGTTGAAC GTCTATCCGC AGTGTCCTAA CTCACGACCA 
GATTGAGTTC ATTCCAGTGA ACCTTAGGTC CCCAGGAACC 
CTAACTCAAG TAAGGTCACT TGGAATCCAG GGGTCCTTGG 
CTAGATCTTG TAG ATC TAGA TTAA 



CCCTCGATAG CTCTACCACC 
GCTATCGAGA TGGTGGAAAG 
TGGCGTTGTT GGTTTCTTTC 
ACCGCAACAA CCAAAGCCCT 
TACTATTTGG CCCAGCGGGA 
ATGATAAACC GGGTCGGAAG 
CTAGCTTCTG GGAACCTTCC 



CGATAAGACC ATCAACAAAG 
TTCTGGTAGT TGTTTCGAGA 
ACTCC ATTC G AATTGTCTCT 
TGAGGTAAGC TTAACACAGG 
GACAAGGTCT ACATCGGTCC 
CTGTTCCAGA TGTAGCACAA 
GAGCTTGAGT TCATCGTGTT 
CTCGAACTCA AGTAGCACAT ■ 



21 



60 
120 
180 
240 
300 
360 
420 
456 



60 
120 
180 
240 
300 
360 
420 
480 
504 
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We claim: 

1. A synthetic Bacillus thuringiensis (B.t.) gene which Ls 
expressed in descendant plant ceDs and encodes a pesticidal 
protein toxin, wherein said synthetic B.t. gene is produced 
by the process of: 5 

selecting a B.t. pesticidal protein toxin desired to be 
expressed in a plant cell; 

obtaining a table indicating codon usage bias for a gene or 
genes more highly expressed in a plant cell than a 
native B.t. gene; 30 

using said table to design a modified coding sequence 
which encodes said protein toxin, whereby said modi- 
fied coding sequence has a frequency of codon usage 
that more closely resembles the frequency of codon 
usage of the plant cell in which it is to be expressed than 15 
did the native B.r. coding sequence encoding said 
protein toxin, said modified coding sequence having at 
least about 10% of the nucleotides changed as com- 
pared to the native B.t. coding sequence; 

obtaining a synthetic B.t. gene comprising a coding region 
comprising said modified coding sequence wherein 
said coding region is under the control of a plant- 
expressible promoter; 

introducing said synthetic B.t. gene into a plant cell; 25 

culturing said cell to obtain descendant plant cells or 
plants comprising descendant plant cells, said descen- 
dant plant cells comprising said synthetic B.t. gene; and 

establishing that said synthetic B.t. gene is expressed in 
said descendant plant cells. 30 

2. The synthetic Bacillus thuringiensis gene of claim 1, 
comprising an A+T base content of less than about 60%. 

3. The synthetic Bacillus thuringiensis gene of claim 1, 
wherein the process of producing said synthetic gene con- 
tains the additional step of determining the frequency of 35 
codon usage of said modified coding sequence. 
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4. A method of designing a synthetic Bacillus thuringien- 
sis (B.t.) gene which is expressed in descendant plant cells, 
comprising the steps of: 

selecting a B.t. pesticidal protein toxiu desired to be 

expressed in a plant cell; 
obtaining a table indicating codon usage bias for a gene or 
genes more highly expressed in a plant cell than a 
native B.t. gene; 

using said table to design a modified coding sequence 
which encodes said protein toxin, whereby said modi- 
fied coding sequence has a frequency of codon usage 
that more closely resembles the frequency of codon 
usage of the plant cell in which it is to be expressed than 
did the native B.t. coding sequence encoding said 
protein toxin, said modified coding sequence having at 
least about 10% of the nucleotides changed as com- 
pared to the native B.t. coding sequence; 
obtaining a synthetic B.t. gene comprising a coding region 
comprising said modified coding sequence wherein 
said coding region is under the control of a plant- 
expressible promoter; 
introducing said synthetic B.t. gene into a plant cell; 
culturing said cell to obtain descendant plant cells, said 
descendant plant cells comprising said synthetic B.t. 
gene; and 

establishing that said synthetic B.t. gene is expressed in 
said descendant plant cells. 

5. The method of claim 4, wherein said synthetic gene 
comprises an A+T base content of less than about 60%. 

6. The method of claim 4, wherein said method further 
comprises the step of determining the frequency of codon 
usage of said modified coding sequence. 

***** 
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TRANSGENIC PLANTS COMPRISING A 
SYNTHETIC INSECTICIDAL CRYSTAL 
PROTEIN GENE HAVING A MODIFIED 
FREQUENCY OF CODON USAGE 

CROSS REFERENCES TO RELATED 
APPLICATIONS 

This is a division of application Ser. No. 08/369,839, filed 
Jan. 6, 1995, now U.S. Pat. No. 5,567,862, which is a 
divisional of application Ser. No. 08/057,191, filed May 3, 
1993 now U.S. Pat. No. 5,380,831; which is a continuation 
of application Ser. No. 07/827,844, filed Jan. 28, 1992, now 
abandoned; which is a continuation of application Ser. No. 
07/242,482, filed Sep. 9, 1988, now abandoned, all of which 
are incorporated herein by reference. 

FIELD OF THE INVENTION 

This invention relates to the field of bacterial molecular 
biology and, in particular, to genetic engineering by recom- 
binant technology for the purpose of protecting plants from 
insect pests. Disclosed herein are the chemical synthesis of 
a modified crystal protein gene from Bacillus thuringiensis 
var. tenebrionis (Btt), and the selective expression of this 
synthetic insecticidal gene. Also disclosed is the transfer of 25 
the cloned synthetic gene into a host microoorganism, 
rendering the organism capable of producing, at improved 
levels of expression, a protein having toxicity to insects. 
This invention facilitates the genetic engineering of bacteria 
and plants to attain desired expression levels of novel toxins 30 
having agronomic value. 

BACKGROUND OF THE INVENTION 

B. thuringiensis (Bt) is unique in its ability to produce, 
during the process of sporulation, proteinaceous, crystalline 
inclusions which are found to be highly toxic to several 
insect pests of agricultural importance. The crystal proteins 
of different Bt strains have a rather narrow host range and 
hence are used commercially as very selective biological ^ 
insecticides. Numerous strains of Bt are toxic to lepi- 
dopteran and dipteran insects. Recently two subspecies (or 
varieties) of Bt have been reported to be pathogenic to 
coleopteran insects: var. tenebrionis (Krieg et al. (1983) Z. 
Angew. Entomol. 96:500-508) and var. san diego 
(Herrnstadt et al. (1986) Biotechnol. 4:305-308). Both 
strains produce flat, rectangular crystal inclusions and have 
a major crystal component of 64-68 kDa (Herrnstadt et al. 
supra; Bernhard (1986) FEMS Microbiol. Lett. 
33:261-265). 

Toxin genes from several subspecies of Bt have been 
cloned and the recombinant clones were found to be toxic to 
lepidopteran and dipteran insect larvae. The two 
coleopteran-active toxin genes have also been isolated and 
expressed. Herrnstadt et al. supra cloned a 5.8 kb BamHI 
fragment of Bt var. san diego DNA. The protein expressed 
in E. colt was toxic to P. luteola (Elm leaf beetle) and had 
a molecular weight of approximately 83 kDa. This 83 kDa 
toxin product from the var. san diego gene was larger than 
the 64 kDa crystal protein isolated from Bt var. sari diego 
cells, suggesting that the Bt var. san diego crystal protein 
may be synthesized as a larger precursor molecule that is 
processed by Bt var. san diego but not by E. coli prior to 
being formed into a crystal. 

Sekar et al. (1987) Proc. Nat. Acad. Sci. USA 65 
84:7036-7040; U.S. patent application Ser. No. 108,285, 
filed Oct. 13, 1987 isolated the crystal protein gene from Btt 
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and determined the nucleotide sequence. This crystal protein 
gene was contained on a 5.9 kb BamHI fragment 
(pNSBF544). A subclone containing the 3 kb Hindlll frag- 
ment from pNSBF544 was constructed. This HindlH frag- 
5 ment contains an open reading frame (ORF) that encodes a 
644-amino acid polypeptide of approximately 73 kDa. 
Extracts of both subclones exhibited toxicity to larvae of 
Colorado potato beetle (Leptinotarsa decemlineata, a 
coleopteran insect). 73- and 65-kDa peptides that cross- 
10 reacted with an antiserum against the crystal protein of var. 
tenebrionis were produced on expression in E. coli. Sporu- 
lating var. tenebrionis cells contain an imraunoreactive 
73-kDa peptide that corresponds to the expected product 
from the ORF of pNSBP544. However, isolated crystals 
15 primarily contain a 65-kDa component. When the crystal 
protein gene was shortened at the N-terminal region, the 
dominant protein product obtained was the 65-kDa peptide. 
A deletion derivative, p544Pst-Met5, was enzymatically 
derived from the 5.9 kb BamHI fragment upon removal of 
20 forty-six amino acid residues from the N-terminus. Expres- 
sion of the N-terrainal deletion derivative, p544Pst-Met5, 
resulted in the production of, almost exclusively, the 65 kDa 
protein. Recently, McPherson et al. (1988) Biotechnology 
6:61-66 demonstrated that the Btt gene contains two func- 
tional translation al initiation codons in the same reading 
frame leading to the production of both the full-length 
protein and an N-terminal truncated form. 

Chimeric toxin genes from several strains of Bt have been 
expressed in plants. Four modified Bt2 genes from var. 
berliner 1715, under the control of the 2' promoter of the 
Agrobacterium TR-DNA, were transferred into tobacco 
plants (Vaeck et ai. (1987) Nature 328:33-37). Insecticidal 
levels of toxin were produced when truncated genes were 
expressed in transgenic plants. However, the steady state 
mRNA levels in the transgenic plants were so low that they 
could not be reliably detected in Northern blot analysis and 
hence were quantified using ribonuclease protection experi- 
ments. Bt mRNA levels in plants producing the highest level 
of protein corresponded to -0.0001% of the poly(A)* 
mRNA. 

In the report by Vaeck et al. (1987) supra, expression of 
chimeric genes containing the entire coding sequence of Bt2 
were compared to those containing truncated Bt2 genes. 
Additionally, some T-DNA constructs included a chimeric 
45 NPTII gene as a marker selectable in plants, whereas other 
constructs carried translational fusions between fragments 
of 5/2 and the NPTII gene. Insecticidal levels of toxin were 
produced when truncated Btl genes or fusion constructs 
were expressed in transgenic plants. Greenhouse grown 
so plants produced -0.02% of the total soluble protein as the 
toxin, or 3 //g of toxin per g. fresh leaf tissue and, even at 
five-fold lower levels, showed 100% mortality in six-<iay 
feeding assays. However, no significant insecticidal activity 
could be obtained using the intact Bt2 coding sequence, 
55 despite the fact that the same promoter was used to direct its 
expression. Intact Bt2 protein and RNA yields in the trans- 
genic plant leaves were 10-50 times lower than those for the 
truncated Bt2 polypeptides or fusion proteins. 

Barton et al. (1987) Plant Physiol. 85:1103-1109 showed 
60 expression of a Bt protein in a system containing a 35S 
promoter, a viral (TMV) leader sequence, the Bt HD-1 4.5 
kb gene (encoding a 645 amino acid protein followed by two 
proline residues) and a nopaline synthase (cos) poly(A)+ 
sequence. Under these conditions expression was observed 
for Bt mRNA at levels up to 47 pg/20 //g RNAand 12 ng/mg 
plant protein. This amount of Bt protein in plant tissue 
produced 100% mortality in two days. This level of expres- 
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sion still represents a low level of mRNA (2.5xl0~ 4 %) and native Btt structural gene is modified in order to contain 

protein (1.2x10" %). codons preferred by highly expressed plant genes, to attain 

Various hybrid proteins consisting of N-terminal frag- an A+T content in nucleotide base composition substantially 

ments of increasing length of the Btl protein fused to NPT1I that found in plants, and also preferably to form a plant 

V a E ' C ° h by H ° fte Ct aL W FEBS Utt - 5 iniliation to eliminate sequences that cause 

a Vnn ^ comaimD g the first 607 amino destabilization, inappropriate polyadenyhtion, degradation 

acids of BP I exhibited insect toxicity; fusion proteins noi and termination of RNA and io avoid sequences that con- 

containing this mmimum N-terminal fragment were non- stitute secondary structure hairpins and RNA splice sites. In 

toxic. Appearance of WTO activity was not dependent upon tne synthetic genes, codons used to specify a given amino 

the presence of insechcidal activity; however, the confer- 10 acid are selected with regard to the distribution frequency of 

mationof the ^2poIypept,de appeared toexert an important codon magc cml d in m - hl ^ ^ £ 

n^ CD n CC Th° n H T^Tl ° fi ^ ^ S ^ mat amino acid ' ** Appreciated by those skilled in 

protein. Tliis sTudy did suggest that the global 3-D structure the art, the distribution frequency of codon usage utilized in 

of the Btl polypeptide is disturbed in truncated polypep- the synthetic gene is a determinant of the level of expression. 
1 c 15 Hence, the synthetic gene is designed such that its distribu- 

A number of researchers have attempted to express plant tion frequency of codon usage deviates, preferably no more 

genes in yeast (Neill et al (1987) Gene 55:303-317; Roth- than 25% from that of highly expressed plant genes and 

™1™ ^-^P^^^^l^^&o^ai.ime) more preferably, no more than about 10%. In addition' 

EMBO J. 5:45SM65) and E. coli (Fuzakawa et al. (1987) consideration is given to the percentage G+C content of the 

c^L^"-. 22 i :125 ~ 127; Vi6S et aL (1986) EMB0 J ' 20 degenerate third base (monocotyledons appear to favor G+C 
5:2439-2444; Gatenby et al. (1987) Eur. J. Biochem. in this position, whereas dicotyledons do not). It is also 

v,ni^ T lD the CaSC ° f Wheat a "g liadin ( Nei11 et ^ recognized that the XCG nucleotide is the least preferred 

(1987) supra); a-amylase (Rothstein et al. (1987) supra) codon in dicots whereas the XTA codon is avoided in both 

genes, and maize zein genes (Coraggio et al. (1986) supra) monocots and dicots. The synthetic genes of this invention 
in yeast, low levels of expression have been reported. Neill 25 also preferably have CG and TA doublet avoidance indices 

et al. have suggested that the low levels of expression of as defined in the Detailed Description closely approximating 

a-gliadin m yeast may be, due in part to codon usage bias, those of the chosen host plant. More preferably these indices 

since ct-ghadin codons for Phe, Leu, Ser, Gly, Tyr and deviate from that of the host by no more than about 10-15% 

X C ™/pm a "? T^rt WCl1 With th K - yCaSt of the Bt S enc of this Mention is performed 

^acceptor tRN^In^. coh however, soybean ^glycinin A2 30 using standard technology known to the art. The Btt struc- 

(Fuzakawa ^et al. (1987) supra) and wheat RuBPC SSU (Vies tura i gene designed for ~ hanced expressicm m lants of the 

et al. (1986) supra; Gatenby et al. (1987) supra) are spcc ific embodiment is enzymatically assembled within a 

expressed adequately. DNA vector from chemically synthesized oligonucleotide 

Not much is known about the makeup of tRNA popula- duplex segments. The synthetic Bt gene is then introduced 

tions m plants. Viotti et al. (1978) Biochim. Biophys. Acta 35 into a plant host cell and expressed by means known to the 

517:125-132 report that maize endosperm actively synthe- art. The insecticidal protein produced upon expression of the 

sizing zein, a storage protein rich in glutamine, leucine, and synthetic Bt gene in plants is functionally equivalent to a 

alanine, is characterized by higher levels of accepting activ- native Bt crystal protein in having toxicity to the same 

lty for these three amino acids than are maize embryo insects. 
tRNAs. This may indicate that the tRNA population of 40 

specific plant tissues may be adapted for optimum transla- BRIEF DESCRIPTION OF THE FIGURES 

tion of highly expressed proteins such as zein. To our FIG. 1 presents the nucleotide sequence for the synthetic 

knowledge, no one has experimentally altered codon bias in gene. Where different, the native sequence as found in 

highly expressed plant genes to determine possible effects of p544Pst-Met5 is shown above. Changes in amino acids 

the protein translation in plants to check the effects on the 45 (underlined) occur in the synthetic sequence with alanine 

level of expression. replacing threonine at residue 2 and leucine replacing the 

SUMMARY OF THE INVENTION St ° P at residue 596 followed by the addition of 13-amino 

acids at the C-terminus. 

It is the overall object of the present inventioD to provide FIG. 2 represents a simplified scheme used in the con- 

a means for plant protection against insect damage. The 50 struction of the synthetic/?// gene. Segments A through M 

invention disclosed herein comprises a chemically synthe- represent oligonucleotide pieces annealed and ligated 

sized gene encoding an insecticidal protein which is func- together to form DNA duplexes having unique splice sites to 

tionally equrvalent to a nat.ve insechcidal protein of Bt. This aUow spec i fic enzymatic assembly of the DNAsegments to 

synthetic gene is designed to be expressed in plants al a level gj ve the desired gene. 

higher than a native Bt gene. It is preferred that the synthetic « mr \ u » „„u„ . •„ a- u .l. i_, r 

gene be designed to be highly expressed in plants as defined , > 1S ' Schema,lc d,a S? m show,n 8 ,he r assemblv of 

herein PrefTr»hl„ .he JL,i . • v la p , , 35 Gennea oligonucleotide segments in the construction of a synthetic 

mTielv J?2Si T 8enC $ ?* leSSt appn>5a ; B " & ene - E«* segment (A through M) is built from oligo- 
mately 85% homologous to an insecticidal protein gene of nucleotides of d £ Knt ^ and ligated , 0 ^ 

' the desired DNA segment. 

It is a particular object of this invention to provide a 60 
synthetic structural gene coding for an insecticidal protein BRIEF DESCRIPTION OF THE SEQUENCES 

from Btt having, for example, the nucleotide sequences SEQ ID NO. 1 is the native DNA gene sequence corre- 
presented in FIG. 1 and spanning nucleotides 1 through 1793 sponding to FIG. 1. 

or spanning nucleotide 1 through 1833 with functional SEQ ID NO. 2 is the corresponding amino acid sequence 

equivalence. 65 encoded by SEQ ID NO. 1. 

In designing synthetic Btt genes of this invention for SEQ ID NO. 3 is the synthetic DNA; gene sequence 
enhanced expression in plants, the DNA sequence of the corresponding to FIG. 1. j 



DETAILED DESCRIPTION OF THE 
INVENTION 
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SEQ ID NO. 4 is the corresponding amino acid sequence initiation of transcription. The structural gene may be one 
encoded by SEQ ID NO. 3. which is normally found in the cell or one which is not 

SEQ ID NO. 5 is the sequence of a synthetic DN A linker normally found in the cellular location wherein it is 
described in Example l(i). introduced, in which case it is termed a heterologous gene. 

SEQ ID NO. 6 is the sequence of the 5' plant consensus 5 A hcter ologous gene may be derived in whole or in part from 
splice site found in Example l(iii)(d). an Y source know to the art, including a bacterial genome or 

SEO ID NO. 7 is the sequence of the 3' plant consensus t £f? me * ^ a nuclear or plasmid DNA, cDNA, viral 
splice site found in Example 1 (iiiXd) UN A or chcmicall y synthesized DNA. A structural gene nay 

c Pn In Nn R • t . „„„ w - A r e A contain one or more modifications in either the coding or the 

found in Table 4 DUcleotlde se ^ ence of Segment A 10 unlranslated regions which could affect tne biolo ^ activ . 

* il y °r the chemical structure of the expression product, the 

SLQ ID NO. 9 is the nucleotide sequence of Segment M rate of expression or the manner of expression control. Such 
found in Table 5. modifications include,, but .are not limited to, mutations, 

insertions, deletions and substitutions of one or more nucle- 
15 otides. The structural gene may constitute an uninterrupted 
coding sequence or it may include one or more introns, 
The following definitions are provided in order to provide bounded by the appropriate, splice junctions. The structural 
clarity as to the intent or scope of their usage in the S ene ma y be a composite of segments , derived from a 
Specification and claims. plurality of sources, naturally occurring or synthetic. The 

Expression refers to the transcription and translation of a 20 structural S ene ma y aiso en H e a fusion P rotein - 
structural gene to yield the encoded protein. The synthetic Bt Synthetic gene refers to a DNA sequence of a structural 
genes of the present invention are designed to be expressed S ene that is chemically synthesized in its entirety or for the 
at a higher level in plants than the corresponding native Bt greater part of the coding region. As exemplified herein, 
genes. As will be appreciated by those skilled in the art, oligonucleotide building blocks are synthesized using pro- 
structural gene expression levels are affected by the regu- 25 cedures known to those skilled in the art and are ligated and 
la lory DNA sequences (promoter, polyadenylation sites, annealed to form gene segments which are then enzymati- 
enhancers, etc.) employed and by the host cell in which the call y assembled to construct the entire gene. As is recog- 
structural gene is expressed. Comparisons o£ synthetic Bt niz ^ b y tnose sk iUed in the art, functionally and structurally 
gene expression and native.fi/ gene expression must be made equivalent genes to the synthetic genes described herein may 
employing analogous regulatory sequences and in the same 30 be prepared by site-specific mutagenesis or other related 
host cell. It will also be apparent that analogous means of methods used in the art. 

assessing gene expression must be employed in such com- Transforming refers to stably introducing a DNA segment 
parisons. carrying a functional gene into an organism that did not 

Promoter refers to the nucleotide sequences at the 5' end ^ previously contain that gene, 
of a structural gene which direct the initiation of transcrip- * Plant tissue includes differentiated and undifferentiated 
tion. Promoter sequences are necessary, but not always tissues of plants, including but not limited to, roots, shoots, 
sufficient, to drive the expression of a downstream gene. In leaves, pollen, seeds, tumor tissue and various forms of cells 
prokaryotes, the promoter drives transcription by providing in culture,- such as single cells, protoplasts, embryos and 
binding sites to RNA polymerases and other initiation and ^ callus tissue. The plant tissue may be in planta or in organ, 
activation factors. Usually promoters drive transcription tissue or cell culture. 

preferentially in the downstream direction, although promo- Plant cell as used herein includes plant cells in planta and 

tional activity can be demonstrated (at a reduced level of plant cells and protoplasts in culture 
expression) when the gene is placed upstream of the pro- Homology refers to identity or near identity of nucleotide 
™ Th tra r nPtl ° n K tQ Z uiat ?*y P™ 01 " 45 or amino acid sequences. As is understood in the art, 
sequences Thus in the construction of heterologous nucleotide mismatches can occur at the third or wobble base 
promoter/stnicturai gene combinations, the structural gene is in the codon without causing amino acid substitutions in the 

• 'fTh ^ 3 ?iT°K er ^ th3t fiDal P ol yP c P^e sequence. Also,, minor nucleotide modi- 
the expression of the gene is controlled by promoter fications (e.g., substitutions, insertions or deletions) in cer- 
sequences The promoter is positioned preferentially 50 tain regions of the gene sequence can be tolerated and 
upstream to the structural gene and at a distance from the considered insignificant whenever such modifications result 
hTJ^ P h! D Slan , h * approximates , the distanct ; in changes in amino acid sequence that do not alter func- 
be ween the promoter and the gene it controls in its natural tionality of the final product It has been shown that chemi- 
se* mg. As is known in the art some variation in this caily synthesized copies of whole, or parts of, gene 
distance can be tolerated without loss of promoter function. 55 seque nces can replace the corresponding regions in the 

A gene refers to the entire DNA portion involved in the natural gene without loss of gene function. Homologs of 

synthesis of a protein. A gene embodies the structural or specific DNA sequences may be identified by those skilled 

coding portion which begins at the 5' end from the transla- in the art using the test of cross-hybridization of nucleic 

tional start codon (usually ATG) and extends to the stop acids under conditions of stringency as is well understood in 

(TAG, TGA or TAA) codon at the 3' end. It also contains a ^ the art (as described in Hames and Higgens (eds.) (1985) 

promoter region, usually located 5' or upstream to the Nucleic Acid Hybridization, I RL Press, Oxford, UK). Extent 

structural gene, which initiates and regulates the expression of homology is often measured in terms of percentage of 

of a structural gene. Also included in a gene are the 3' end identity between the sequences compared 

and polyfAH addition sequences. Functionally equivalent refers to identity or near identity 

Structural gene is that portion of a gene comprising a 65 of function. A synthetic gene product which is toxic to at 

DNA segment encoding a protein, polypeptide or a portion least one of the same insect species as a natural Bt protein 

thereof, and excluding the 5' sequence which drives the is considered functionally equivalent thereto. As exemplified 
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herein, both natural and synthetic Bit genes encode 65 kDa cal or biological manipulation (including but not limited to 

inseclicidal proteins having essentially identical amino acid substitution, addition, insertion, deletion extraction 

sequences and having toxicity lo coleopteran insects. The isolation, mutation and replication) of the original source * 

synthetic Bl genes of the present invention are not consid- Chemically synthesized, as related to a sequence of DNA,- 

ered to be functionally equivalent to native Bl genes, since 5 m6ans , hat tn6 com t nudeotides ^ e asscrablcd m 

they arc expressible at a h.gher level ,n plants than native B, vilro . Manua] syn , hesjs of DNA may ^ accom . 

g I? ' r c a a r L r plished using well established procedures (Caruthers, M. 

iTh?H P fi C f^ T referS ,0 ,- ,he f ref !, r " < 1983 > ia Methodology of DNA andRNA Sequencing, W iss- 

ence exhibited by a specfic host cell in usage of nucleotide ma „ (ed } p rae gbUshcn, New YorU 9 ch apte f n } , or 

colons to specify a given ammo ac,^ To determine the 10 automated chemLl synthesis can be performed Lng one of 
frequency of usage of a particular codon id a gene, the u r • « -i . , . . ^"h>^ 1 ^^ 

nulerofc^rr^sofLccKloninthegeneLivided * "T^™ dTZ^ Th, k „„ • 

by the total number of occurrences of all colons specifying J^^ST* ,0 ^ ^"T* " ^ T 

the same amino acid in the gene. Table 1, for example, give! ^S^^T^A, * ^1 *T a™ £ 

f r p ffl ,on.„ „„a~~ f« d/ u* l amount oi its specinc mRNA transcripts produced is suffi- 

S.Sn?JSJT w f neS ' ^ 15 be quanUfied in Northern blots and, thus, represents 

obtained by analysts of fou Bl genes whose sequences are a Ievel of spedfic m R N Aexpressed corresponding to greater 
pubkcly availably S.muarly the requency o preferred lhan or P ual t0 approxin * ely 0 .001% of the pofy(A) + 

l^Z^L V a f hos, H oeU J :an calcula,e , d ^ mRNA. To date, nafcral A genes are transcribed J?, toil 

averaging frequency of preferred codon usage in a large wberein |he amoun , o{ ^ RN ^ 

number of genes expressed by the host cell. It is preferable ,„ „■ , ,„.„,• , . f, F . , / . 

that this analysis be limited to genes that are highly 20 Howell ,h T* No , r,hern . blo, ,ec F hm 1 ue - 

expressed by the host cell. Table 1, for example, give S 8 «he ^^,'1^^ invention, ranscnption of a syn- 
frequency -of «*j usage by highly expresse/genLxhib- l^iLSn^ 

ited by dicotyledonous plants, and monocotyledonous a»^a C,, „w u ■ u j * 
«i««t r tk. jvL. i * || m , duced but also results in enhanced expression of the trans- 
plants. The dicot codon usage was calculated using 154 „ 1a< - nrrt/1ll/ , t . • . . j • • j 7. 

„ t ■ & u . ^ L , 25 lation product which is measured in insecticidal bioassays. 

highly expressed coding sequences obtained from Genbank rw»<»i nmt ~i -a i * i * • UIuaaoo {°; 

.rl i;ot<,^ „ xfui 1 # a Crystal protein or insecticidal crystal protein or crystal 

which are listed in Table 1. Monocot codon usage was t/w - r ^t ar f tntUa • * • ; c , 

calculated using 53 monoco, nuclear gene coding sequences l°™ a ^ZTTk'T °, 
obtained from Genbank and listed in Table 1 located in f T m S 'T ° f * ' component 

Example 1 exhibits selective pathogenicity to different species of 

„„ p * . . , 30 insects. The molecular size of the major protein isolated 

When synthesizing a gene for improved expression in a from pa rasporal crystals varies depending on the strain of Bt 
host cell it is desirable to design the gene such that its frorn whicn it is derived c , ins hayi molecular 
frequency of codon usage approaches the frequency of weighls of approximately 132, 65, and 28 kDa have been 
preferred codon usage of the host cell. reported „ has been showQ tQat fte approximately 132 kDa 

Tlie percent deviaUon of the frequency of preferred codon 35 protein is a protoxin that is cleaved to form an approximately 
usage for a synthetic gene from that employed by a host cell 65 kDa toxin. 

is calculated first by determining the percent deviation of the The crystal protein gene refers to the DNA sequence 
frequency of usage of a single codon from that of the host encoding the insecticidal crystal protein in either full length 
cell followed by obtaining the average deviation over all protoxin or toxin form, depending on the strain of Bt from 
codons. As defined herein this calculation includes unique 40 which the gene is derived. 

codons (i.e., ATG and TGG). The frequency of preferred The authors of this invention observed that expression in 

codon usage of the synthetic Bit gene, whose sequence is plants of Bt crystal protein mRNA occurs at levels that are 
given in FIG. 1, is given in Table 1. The frequency of not routinely detectable in Northern blots and that low levels 
preferred usage of the codon GTA' for valine in the syn- G f Bt crystal protein expression correspond to this low level 
thetic gene (0.10) deviates from that preferred by dicots 45 of mRNA expression. It is preferred for exploitation of these 
(0.12) by 0.02/0.12=0.167 or 16.7%. The average deviation genes as potential biocontrol methods that the level of 
over all amino acid codons of the Btt synthetic gene codon expression of Bt genes in plant cells be improved and that 
usage from that of dicot plants is 7.8%. In general terms the the stability of Bt mRNA in plants be optimized. This will 
overall average deviation of the codon usage of a synthetic allow greater levels of Bt mRNA to accumulate and will 
gene from that of a host cell is calculated using the equation 50 result in an increase in the amount of insecticidal protein in 

plant tissues. This is essential for the control of insects that 
Xn ~ >n K 100 are relatively resistant to Bt protein. 

x " Thus, this invention is based on the recognition that 

z expression levels of desired, recombinant insecticidal pro- 

55 tein in transgenic plants can be improved via increased 
expression of stabilized mRNA transcripts; and that, 
where X,«frequency of usage for codon n in the host cell; conversely, detection of these stabilized RNA transcripts 
Y„<=frequency of usage for codon n in the synthetic gene. may be utilized to measure expression of translational 
Where n represents an individual codon that specifies an product (protein). .This invention provides a means of resolv- 
amino acid, the total number of codons is Z, which in the 60 ing the problem of low expression of insecticidal protein 
preferred embodiment is 61. The overall deviation of the RNA in plants and, therefore, of low protein expression 
frequency of codon usage, A, for all amino acids should through the use of an improved, synthetic gene specifying an 
preferably be less than about 25%, and more preferably less insecticidal crystal protein from Bt. 
than about 10%. Attempts to improve the levels of expression of Bt genes 

Derived from is used to mean taken, obtained, received, 65 in plants have centered on comparative studies evaluating 
traced, replicated or descended from a source (chemical parameters such as gene type, gene length, choice of 
and/or biological). A derivative may be produced by chemi- promoters, addition of plant viral untranslated RNA leader, 
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addition of intron sequence and modification of nucleotides highly expressed genes enhances translation and is required 

surrounding the initiation ATG codon. To date, changes in for maintaining mRNA stahility in yeast. Without doubt, the 

these parameters have not led to significant enhancement of degree of codon bias is an important factor to consider when 

Bt protein expression in plants. Applicants find that, engineering high expression of heterologous genes in yeast 

surprisingly, to express Bt proteins at the desired level in 5 and other systems. 

plants, modifications in the coding region of the gene were Experimental evidence obtained from point mutations and 

effective. Structural-function relationships can be studied deletion analysis has indicated that in eukaryotic genes 

using site-specific mutagenesis by replacement of restriction specific sequences are associated with post -transcriptional 

fragments with synthetic DNA duplexes containing the processing, RNA destabilization, translational termination, 
desired nucleotide changes (Lo et al. (1984) Proc. Natl, ao intron splicing and the like. These are preferably employed 

Acad. Sci. 81:2285-2289). However, recent advances in in the synthetic genes of this invention. In designing a 

recombinant DNA technology now make it feasible to bacterial gene for expression in plants, sequences which 

chemically synthesize an entire gene designed specifically interfere with the efficacy of gene expression are eliminated, 

for a desired function. Thus, the Btt coding region was In designing a synthetic gene, modifications in nucleotide 
chemically synthesized, modified in such a way as to 15 sequence of the coding region are made to modify the A+T 

improve its expression in plants. Also, gene synthesis pro- content in DNA base composition of the synthetic gene to 

vides the opportunity to design the gene so as to facilitate its reflect that normally found in genes for highly expressed 

subsequent mutagenesis by incorporating a number of proteins native to the host cell. Preferably the A+T content 

appropriately positioned restriction endonuclease sites into of the synthetic gene is substantially equal to that of said 
the gene. 20 genes for highly expressed proteins. In genes encoding 

The present invention provides a synthetic Bt gene for a highly expressed plant proteins, the A+T content is approxi- 

crystal protein toxic to an insect. As exemplified herein, this mately 55%. It is preferred that the synthetic gene have an 

protein is toxic to coleopteran insects. To the end of improv- A+T content near this value, and not sufficiently high as to 

ing expression of this insecticidal protein in plants, this cause destabilization of RNA and, therefore, lower the 

invention provides a DNA segment homologous to a Btt 25 protein expression levels. More preferably, the A+T content 

structural gene and, as exemplified herein, having approxi- is no more than about 60% and most preferably is about 

mately 85% homology to the Btt structural gene in p544Pst- 55%. Also, for ultimate expression in plants, the synthetic 

Met5. In this embodiment the structural gene encoding a Btt gene nucleotide sequence is preferably modified to form a 

insecticidal protein is obtained through chemical synthesis plant initiation sequence at the 5' end of the coding region, 

of the coding region. A chemically synthesized gene is used 30 In addition, particular attention is preferably given to assure 

in this embodiment because it best allows for easy and that unique restriction sites are placed in strategic positions 

efficacious accommodation of modifications in nucleotide to allow efficient assembly of oligonucleotide segments 

sequences required to achieve improved levels of cross- during construction of the synthetic gene and to facilitate 

expression. subsequent nucleotide modification. As a result of these 

Today, in general, chemical synthesis is a preferred 35 modifications in coding region of the native Bt gene, the 

method to obtain a desired modified gene. However, to date, preferred synthetic gene is expressed in plants at an 

no plant protein gene has been chemically synthesized nor enhanced level when compared to that observed with natural 

has any synthetic gene for a bacterial protein been expressed Bt structural genes. 

in plants. In this invention, the approach adopted for syn- In specific embodiments, the synthetic Bt gene of this 

thesizing the gene consists of designing an improved nucle- 40 invention encodes a Btt protein toxic to coleopteran insects, 

otide sequence for the coding region and assembling the Preferably, the toxic polypeptide is about 598 amino acids in 

gene from chemically synthesized oligonucleotide seg- length, is al least 75% homologous to a Btt polypeptide, and, 

ments. In designing the gene, the coding region of the as exemplified herein, is essentially identical to the protein 

naturally-occurring gene, preferably from the Btt subclone, encoded by p544Pst-Met5, except for replacement of threo- 

p544Pst-Met5, encoding a 65 kDa. polypeptide having 45 nine by alanine at residue 2. This amino acid substitution 

coleoperan toxicity, is scanned for possible modifications results as a consequence of the necessity to introduce a 

which would result in improved expression of the synthetic guanine base at position +4 in the coding sequence, 

gene in plants. For example, to optimize the efficiency of In designing the synthetic gene of this invention, the 

translation, codons preferred in highly expressed proteins of coding region from the Btt subclone, p544Pst-Met5, encod- 

the host cell are utilized. 50 ing a 65 kDa polypeptide having coleopteran toxicity, is 

Bias in codon choice within genes in a single species scanned for possible modifications which would result in 

appears related to the level of expression of the protein improved expression of the synthetic gene in plants. For 

encoded by that gene. Codon bias is most extreme in highly example, in preferred embodiments, the synthetic insecti- 

expressed proteins of £. coli and yeast. In these organisms, cidal protein is strongly expressed in dicot plants, e.g., 

a strong positive correlation has been reported between the 55 tobacco, tomato, cotton, etc., and hence, a synthetic gene 

abundance of an isoaccepting tRNA species and the favored under these conditions is designed to incorporate to advan- 

synonyraous codon. In one group of highly expressed pro- tage codons used preferentially by highly expressed dicot 

teins in yeast, over 96% of the amino acids are encoded by proteins. In embodiments where enhanced expression of 

only. 25 of the 61 available codons (Bennetzen and Hall insecticidal protein is desired in a monocot, codons pre- 

(1982) J. Biol. Chem. 257:3026-3031). These 25 codons are 60 ferred by highly expressed monocot proteins (given in Table 

preferred in all sequenced yeast genes, but the degree of 1) are employed in designing the synthetic gene, 

preference varies with the level of expression of the genes. In general, genes within a taxonomic group exhibit simi- 

Recently, Hoekema and colleagues (1987) Mol. Cell. Biol. larities in codon choice, regardless of the function of these 

7:2914-2924 reported that replacement of these 25 preferred genes. Thus an estimate of the overall use of the genetic code 

codons by minor codons in the 5* end of the highly expressed 65 by a taxonomic group can be obtained by summing codon 

yeast gene PGK1 results in a decreased level of both protein frequencies of all its sequenced genes. This species-specific 

and mRNA. They concluded that biased codon choice in codon choice is reported in this invention from analysis of 



6,013,523 



11 



208 plant genes. Both raonocot and dicot plants are analyzed 
individually to determine whether these broader taxonomic 
groups are characterized by different patterns of synony- 
mous codon preference. The 208 plant genes included in the 
codon analysis code for proteins having a wide range of 
functions and they represent 6 monocot and 36 dicot species. 
These proteins are present in different plant tissues at 
varying levels of expression. 

In this invention it is shown that the relative use of 
synonymous codons differs between the monocots and the 
dicots. In general, the most important factor in discriminat- 
ing between monocot and dicot patterns of codon usage is 
the percentage G+C content of the degenerate third base. In 
monocots, 16 of 18 amino acids favor G+C in this position, 
while dicots only favor G+C in 7 of 18 amino acids. 

The G ending codons for Thr, Pro, Ala and Ser are 
avoided in both monocots and dicots because they contain C 
in codon position II. The CG dinucleotide is strongly 
avoided in plants (Boudraa (1987) Genet. Sel. Evol. 
19:143-154) and other eukaryotes (Grantham et al. (1985) 
Bull. Inst. Pasteur 83:95-148), possibly due to regulation 
involving methylation. In dicots, XCG is always the least 
favored codon, while in monocots this is not the case. The 
doublet TA is also avoided in codon positions II and III in 
most eukaryotes, and this is true of both monocots and 
dicots. 

Grantham and colleagues (1986) Oxford Surveys in Evol. 
Biol. 3:48-81 have developed two codon choice indices to 
quantify CG and TA doublet avoidance in codon positions II 
and III. XCG/XCC is the ratio of codons having C as base 
II of G-ending to C-ending triplets, while XTA/XTT is the 
ratio of A-ending to T-ending triplets with T as the second 
base. These indices have been calculated for the plant data 
in this paper (Table 2) and support the conclusion that 
monocot and dicot species differ in their use of these 
dioucleotides. 

TABLE 2 

Avoidance of CG and TA doublets in codons position II-III. 
XCG/XCC and XTA/XAA values arc multiplied by 100. 



12 



30 



15 



25 



30 



35 



40 



Group 



Plants 



Di- 
cots 



Mono- 
cots 



Maize 



Soy- 
bean 



RuBPC 
SSU 



CAB 



XCG/XCC 
XTA/XTT 



40 

37 



30 
35 



61 
47 



67 
43 



37 
43 



38 
9 



22 
13 



RuBPC SSU = ribulose 3,5 bisphosphate small subunit 
CAB = chlorophyll a/b binding protein 

Additionally, for two species, soybean and maize, 
species-specific codon usage profiles were calculated (not 
shown). The maize codon usage pattern resembles that of 
monocots in general, since these sequences represent over 
half of the monocot sequences available. The codon profile 
of the maize subsamplc is even more strikingly biased in its 
preference for G+C in codon position III. On the other hand, 
the soybean codon usage pattern is almost identical to the 
general dicot pattern, even though it represents a much 
smaller portion of the entire dicot sample. 

In order to delermine whether the coding strategy of 
highly expressed genes such as the ribulose 1,5 bisphosphate 
small subunit (RuBPC SSU) and chlorophyll a/b binding 
protein (CAB) is more biased thao that of plant genes in 
general, codon usage profiles for subsets of these genes (19 
and 17 sequences, respectively) were calculated (not 
shown). The RuBPC SSU and CAB pooled samples are 
characterized by stronger avoidance of the codons XCG and 
XTA than in the larger monocot and dicot samples (Table 2). 



45 



50 



55 



60 
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Although most of the geneJ'in these subsamples are dicot in 
origin (17/19 and 15/17), their codon profile resembles that 
of the monocots in that G+C is utilized in the degenerate 
base III. 

The use of pooled data for highly expressed genes may 
obscure identification of species-specific patterns in codon 
choice. Therefore, the codon choices of individual genes for 
RuBPC SSU and CAB were tabulated. The preferred codons 
of the maize and wheat genes for RuBPC SSU and CAB arc 
more restricted in general than are those of the dicot species. 
This is in agreement with Matsuoka et al. (1987) J. Biochera. 
102:673-676) who noted the extreme codon bias of the 
maize RuBPC SSU gene as well as two other highly 
expressed genes in maize leaves, CAB and phospho- 
enolpyruvate carboxylase. These genes almost completely 
avoid the use of A+T in codon position III, although this 
codon bias was not as pronounced in non-leaf proteins such 
as alcohol dehydrogenase, zein 22 kDa sub-unit, sucrose 
synthetase and ATP/ADP translocator. Since the wheat SSU 
and CAB genes have a similar pattern of codon preference, 
this may reflect a common monocot pattern for these highly 
expressed genes in leaves. The CAB gene for Lemna and the 
RuBPC SSU genes for Chlamdomonas share a similar 
extreme preference for G+C in codon position III. In dicot 
CAB genes, however, A+T degenerate bases are preferred 
by some synonymous codons (e.g., GCT for Ala, CTT for 
Leu, GGA and GGT for Gly). In general, the G+C prefer- 
ence is less pronounced for both RuBPC SSU and CAB 
genes in dicots than in monocots. 

In designing a synthetic gene for expression in plants, 
attempts are also made to eliminate sequences which inter- 
fere with the efficacy of gene expression. Sequences such as 
the plant polyadenylalion signals, e.g., AATAAA, poly- 
merase II termination sequence, e.g., CAN (7 . 9) AGTNNAA, 
UCUUCGG hairpins and plant consensus splice sites are 
highlighted and, if present in the native Btt coding sequence, 
are modified so as to eliminate potentially deleterious 
sequences. 

Modifications in nucleotide sequence of the Btt coding 
region are also preferably made to reduce the A+T content 
in DNAbase composition. The Btt coding region has an A+T 
content of 64%, which is about 10% higher than that found 
in a typical plant coding region. Since A+T-rich regions 
typify plant intergenic regions and plant regulatory regions, 
it is deemed prudent to reduce the A+T content. The syn- 
thetic Btt gene is designed to have an A+T content of 55%, 
in keeping with values usually found in plants. 

Also, a single modification (to introduce guanine in lieu 
of adenine) at the fourth nucleotide position in theBtf coding 
sequence is made in the preferred embodiment to form a 
sequence consonant with that believed to function as a plant 
initiation sequence (Taylor et al. (1987) Mol. Gen. Genet. 
210:572-577) in optimization of expression. In addition, in 
exemplifying this invention thirty-nine nucleotides (thirteen 
codons) are added to the coding region of the synthetic gene 
in an attempt to stabilize primary transcripts. However, it 
appears that equally stable transcripts are obtained in the 
absence of this extension polypeptide containing thirty-nine 
nucleotides. 

Not all of the above-mentioned modifications of the 
natural Bt gene must be made in constructing a synthetic Bt 
gene in order to obtain enhanced expression. For example, 
a syntheiic gene may be synthesized for other purposes in 
addition to that of achieving enhanced levels of expression. 
Under these conditions, the original sequence of the natural 
Bt gene may be preserved within a region of DNA corre- 
sponding to one or more, but not all, segments used to 
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construct the synthetic gene. Depending on the desired 
purpose of the gene, modification may encompass substitu- 
tion of one or more, but not all, of the oligonucleotide 
segments used to construct the synthetic gene by a corre- 
sponding region of natural Bt sequence. 

As is known to those skilled in the art of synthesizing 
genes (Mandecki et al. (1985) Proc. Natl. Acad. Sci. 
82:3543-3547; Feretti et al. (1986) Proc. Natl. Acad. Sci. 
83:599-603), the DNA sequence to be synthesized is 
divided into segment lengths which can be synthesized 
conveniently and without undue complication. As exempli- 
fied herein, in preparing to synthesize the Btt gene, the 
coding region is divided into thirteen segments (A-M). Each 
segment has unique restriction sequences at the cohesive 
ends. Segment A, for example, is 228 base pairs in length 
and is constructed from six oligonucleotide sections, each 
containing approximately 75 bases. Single-stranded oligo- 
nucleotides are annealed and ligated to form DNA segments. 
The length of the protruding cohesive ends in complemen- 
tary oligonucleotide segments is four to five residues. In the 
strategy evolved for gene synthesis, the sites designed for 
the joining of oligonucleotide pieces and DNA segments are 
different from the restriction sites created in the gene. 

In the specific embodiment, each DNA segment is cloned 
into a pIC-20 vector for amplification of the DNA. The 
nucleotide sequence of each fragment is determined at this 
stage by the dideoxy method using the recombinant phage 
DNA as templates and selected synthetic oligonucleotides as 
primers. 

As exemplified herein and illustrated schematically in 
FIGS. 3 and 4, each segment individually (e.g., segment M) 
is excised at the flanking restriction sites from its cloning 
vector and spliced into the vector containing segment A. 
Most often, segments are added as a paired segment instead 
of as a single segment to increase efficiency. Thus, the entire 
gene is constructed in the original plasmid harboring seg- 
ment A. The nucleotide sequence of the entire gene is 
determined and found to correspond exactly to that shown in 
FIG. 1. 

In preferred embodiments the "synthetic Btt gene is 
expressed in plants at an enhanced level when compared to 
that observed with natural Btt structural genes. To that end, 
the synthetic structural gene is combined with a promoter 
functional in plants, the structural gene and the promoter 
region being in such position and orientation with respect to 
each other that the structural gene can be expressed in a cell 
in which the promoter region is active, thereby forming a 
functional gene. The promoter regions include, but are not 
limited to, bacterial and plant promoter regions. To express 
the promoter region/structural gene combination, the DNA 
segment carrying the combination is contained by a cell. 
Combinations which include plant promoter regions are 
contained by plant cells, which, in turn, may be contained by 
plants or seeds. Combinations which include bacterial pro- 
moter regions are contained by bacteria, e.g., Bt or E. coli. 
Those in the art will recognize that expression in types of 
micro-organisms other than bacteria may in some circum- 
stances be desirable and, given the present disclosure, fea- 
sible without undue experimentation. 

The recombinant DNA molecule carrying a synthetic 
structural gene under promoter control can be introduced 
into plant tissue by any means known to those skilled in the 
art. The technique used for a given plant species or specific 
type of plant tissue depends on the known successful tech- 
niques. As novel means are developed for the stable inser- 
tion of foreign genes into plant cells and for manipulating 
the modified cells, skilled artisans will be able to select from 
known means to achieve a desired result. Means for intro- 
ducing recombinant DNA into plant tissue include, but are 
not limited to, direct DNA uptake (Paszkowski, J. et al. 
(1984) EMBO J 3:2717), electroporation (Fromm, M. et al. 
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(1985) Proc. Natl. Acad. Sci. USA 82:5824), microinjection 
(Crossway, A. et al. (1986) Mol. Gen. Genet. 202:179), or 
T-DNA mediated transfer from Agrobacterium tumefaciens 
to the plant tissue. There appears to be no fundamental 
limitation of T-DNA transformation to the natural host range 
of Agrobacterium. Successful T-DNA-mediated transforma- 
tion of raonocots (Hooykaas-Van Slogteren, G. et al. (1984) 
Nature 311:763), gymnosperm (Dandekar, A. el al. (1987) 
Biotechnology 5:587) and algae (Ausich, R., EPO applica- 
tion 108,580) has been reported. Representative T-DNA 
vector systems are described in the following references: An, 
G. et al. (1985) EMBO J. 4:277; Herrera-Estrella, L. et al. 
(1983) Nature 303:209; Herrera-Estrella, L. et al. (1983) 
EMBO J. 2:987; Herrera-Estrella, L. et al. (1985) in Plant 
Genetic Engineerings New York: Cambridge University 
Press, p. 63. Once introduced into the plant tissue, the 
expression of the structural gene may be assayed by any 
means known to the art, and expression may be measured as 
mRNA transcribed or as protein synthesized. Techniques are 
known for the in vitro culture of plant tissue, and in a 
number of cases, for regeneration into whole plants. Proce- 
dures for transferring the introduced expression complex to 
commercially useful cultivars are known to those skilled in 
the art. 

In one of its preferred embodiments the invention dis- 
closed herein comprises expression in plant cells of a 
synthetic insecticidal structural gene under control of a plant 
expressible promoter, that is to say, by inserting the insec- 
ticide structural gene into T-DNA under control of a plant 
expressible promoter and introducing the T-DNA containing 
the insert into a plant cell using known means, once plant 
cells expressing a synthetic insecticidal structural gene 
under control of a plant expressible promoter are obtained, 
plant tissues and whole plants can be regenerated therefrom 
using methods and techniques well-known in the art. The 
regenerated plants are then reproduced by conventional 
means and the introduced genes can be transferred to other 
strains and cultivars by conventional plant breeding tech- 
niques. 

The introduction and expression of the synthetic structural 
gene for an insecticidal protein can be used to protect a crop 
from infestation with common insect pests. Other uses of the 
invention, exploiting the properties of other insecticide 
structural genes introduced into other plant species will be 
readily apparent to those skilled in the art. The invention in 
principle applies to. introduction of any synthetic insecticide 
structural gene into any plant species into which foreign 
DNA (in the preferred embodiment T-DNA) can be intro- 
duced and in which said DNA can remain stably replicated. 
In general, these taxa presently include, but are not limited 
to, gymnosperms and dicotyledonous plants, such as sun- 
flower (family Compositeae), tobacco (family Solanaceae), 
alfalfa, soybeans and other legumes (family Leguminoseae), 
cotton (family Malvaceae), and most vegetables, as well as 
monocotyledonous plants. A plant containing in its tissues 
increased levels of insecticidal protein will control less 
susceptible types of insect, thus providing advantage over 
present insecticidal uses of Bt. By incorporation of the 
insecticidal protein into the tissues of a plant, the present 
invention additionally provides advantage over present uses 
of insecticides by eliminating instances of nonuniform appli- 
cation and the costs of buying and applying insecticidal 
preparations to a field. Also, the present invention eliminates 
the need for careful timing of application of such prepara- 
tions since small larvae are most sensitive to insecticidal 
protein and the protein is always present, minimizing crop 
damage that would otherwise result from preapplication 
larval foraging. 

This invention combines the specific teachings of the 
present disclosure with a variety of techniques and expedi- 
ents known in the art. The choice of expedients depends on 
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variables such as the choice of insect icidal protein from a Bt patent application Ser. No. 108,285, filed Oct. 13, 1987, 

strain, the extent of modification in preferred codon usage, which is hereby incorporated by reference. A 3.0 kbp 

manipulation of sequences considered to be destabilizing to Hindlll fragment carrying the crystal protein gene of 

UNA or sequences prematurely terminating transcription, p NSBP544 is inserted into the HindlU site of P IC-20H 

insertions of restriction sites within the design of the syn- 5 (Marsn , j. L> ct al (19g4) Gcne ^2:481^8S) thereby 

thetic gene to allow future nucleotide modifications addi- ieldi a lasmid designated p5 44-HindIII, which is on 

lion of introns or enhancer sequences to the 5' and/or 3< ends d it Expression in E. coli yiekis a 73 kDa crystal protein 

of the synthetic structural gene the promoter region, the host in addition * ^ 65 RDa * charactcristic ^ f thc P crystal 

in which a promoter region/structural gene combination is • . . f D K . t wiui^i^iai 

expressed, and the like As novel insecticidal proteins and pr0lem ° btained from Btt lsolates " 

toxic polypeptides are discovered, and as sequences respon- 10 A f 9 kb P BamHI fragment carrying the crystal protein 

sible for enhanced cross-expression (expression of a foreign S ene ^ removed from pNSBP544 and inserted into BamHI- 

structural gene in a given host) are elucidated, those of linearized pIC-20H DNA. The resulting plasmid, p405/44-7, 

ordinary skill will be able to select among those elements to is digested with Bglll and religated, thereby removing 

produce "improved" synthetic genes for desired proteins Bacillus sequences flanking the 3'-end of the crystal protein 

having agronomic value. The fundamental aspect of the 15 gene. TTie resulting plasmid, p405/54-12, is digested with 

present invention is the ability to synthesize a novel gene PstI and religated, thereby removing Bacillus sequences 

coding for an insecticidal protein, designed so that the flanking the 5'-end of the crystal protein and about 150 bp 

protein will be expressed at an enhanced level in plants, yet from the 5 '-end of the crystal protein structural gene. The 

so that it will retain its inherent property of insect toxicity resulting plasmid, p405/81-4, is digested with SphJ and PstI 

and retain or increase its specific insecticidal activity. 20 and is mixed with and li gated to a synthetic linker having the 

EXAMPLES following structure: 

The following Examples are presented as illustrations of SD MetThrAia 

scope of this invention, which is determined by the claims. 25 PBt j- 

The following strains were deposited with the Patent 
Culture Collection, Northern Regional Research Center, (SD indicates the location of a Shine-Dalgarno prokaryotic 
1815 N. University Street, Peoria, 111. 61604. ribosome binding site.) The resulting plasmid, p544Pst- 

Met5, contains a structural gene encoding a protein identical 
30 to one encoded by pNSBP544 except for a deletion of the 
amino-terminal 47 amino acid residues. The nucleotide 
sequence of the Btt coding region in p544Pst-Met5 is 
presented in FIG. 1. In bioassays (Sekar and Adang, U.S. 
patent application Ser. No. 108,285, supra), the proteins 



Strain 


Deposited on 


Accession # 


E. coli MCI 061 (p544-HindIII) 
E. coli MCI 061 (p544Pst-Met5) 


6 October 1987 
6 October 1987 


NRRL B-18257 
NRRL B- 18258 



The deposited strains are provided for the convenience of 35 ^ coded b , y ? he Btt S ene 10 P NSBP544 and the 

those in the art, and are not necessary to practice the present N-terminal deleUon derivative, P 544Pst-Met5, were shown 

invention, which may be practiced with the present disclo- !° be !** u * lly t0 , xlc - M] of the P lasmids mentioned above 

sure in combination with publicly available protocols, have their crystal protein genes in the same orientation as the 

information, and materials. £. coli MC1061, a good host for laC „ gene of ™ veclor 

plasmid transformations, was disclosed by Casadaban, M. J. 40 ( u ) Modification of Preferred Codon Usage 

and Cohen, S. N. (1980) J. Mol. Biol. 138:179-207. Table 1 presents the frequency of codon usage for (A) 

dicot proteins, (B) Bt proteins, (C) the synthetic Btt gene, 

Example 1 anc j (5) monocot proteins. Although some codons for a 

Design of the Synthetic Insecticidal Crystal Protein Gene particular amino acid are utilized to approximately the same 

(i) Preparation of Toxic Subclones of the Btt Gene 45 extent by both dicot and Bt proteins (e.g., the codons for 

Construction, isolation, and characterization of pNSB544 serine), for the most part, the distribution of codon fre- 

is disclosed by Sekar, V. et al. (1987) Proc. Natl. Acad. Sci. quency varies significantly between dicot and Bt proteins, as 

USA 84:7036-7040, and Sekar, V. and Adang, M J., U.S. illustrated in columns A and B in Table 1. 



TABLE 1 



Frequency Codon Usage 
Distribution Fraction 



Amino (A)Dicot (BjBt (C)Synthelic (O)Monocot 

Acid Codon Genes Crcncs Btt Gene Genes 



Gly 


GGG 


0.12 


0.08 


0.J 3 


0.21 


Gly 


GGA 


0.38 


0.53 


0.37 


0.17 


Gly 


GOT 


0.33 J 


0.24 


0.34 


0.18 


Gly 


GGC 


0.16 


0.16 


0.16 


0.43 


Glu 


GAG 


0.51 


0.33 


0.52 


0.75 


Glu 


GAA 


0.49 


0.87 


0.48 


0.25 


Asp 


GAT 


0.58 


0.68 


0.56 


0.27 


Asp 


GAC 


0.42 


0.32 


0.44 


0.73 


Val 


GTG 


0.29 


0.15 


0.30 


0.36 


Val 


GTA 


0.12 


0.32 


0.10 


0.08 


Val 


GTT 


0.39 


0.29 


0.35 


0.19 
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TABLE l^coniinued 



Frequency Codon Usage 



vai 


u JL 


0.20 


0.24 


0.25 


0.37 


Ala 


GCG 


0.06 


0.12 


0.06 


0.22 


Ala 


GCA 


0.25 


030 


0.24 


0.16 


Ala 


GOT 


0.42 


032 


0.41 


0.24 


Aia 


GCC 


0.27 


0.06 


0.29 


038 


Lys 


AAG 


0.61 


0.13 


0.58 


0.86 


Lys 


AAA 


039 


0.87 


0.42 


0.14 


Asn 


AAT 


0.45 


0.79 


0.44 


0.25 


Asn 


AAC 


035 


0.21 


036 


0.75 


Met 


ATG 


1.00 


1.00 


1.00 


1.00 


He 


ATA 


0.18 


0.30 


0.20 


0.11 


lie 


ATT 


0.45 


037 


0.43 


0.24 


He 


ATC 


037 


0.13 


0.37 


0.64 


Thr 


ACG 


0.08 


0.14 


0.07 


0.20 


Thr 


ACA 


0.27 


0.68 


0.27 


0.14 


Thr 


ACT 


035 


0.14 


034 


0.19 


Tin- 

I nr 


ACC 


030 


0.05 


0.32 


0.46 


lip 


TGG 


1.00 


1.00 


1.00 


1.00 


End 


TGA 


033 


0.00 


0.00 


0.34 


Cys 


TGT 


0.44 


033 


0.33 


0.30 


Cys 


TGC 


0.56 


0.67 


0,67 


0.70 


End 


TAG 


0.19 


0.00- 


0.00 


0.36 


End 


TAA 


0.48 


1.00 


1.00 


030 


Tyr 


TAT 


0.43 


0.81 


0.43 


0.21 


Tyr 


TAC 


037 


0.19 


0.57 


0.79 


Phe 


TTT 


0.45 


0.75 


0.44 


0.25 


Phe 


TTC 


035 


0.25 


0.56 


0.75 


Scr 


AGT 


0.14 


0.25 


0.13 


0.08 


Ser 


AGC 


0.18 


0.13 


0.19 


0.26 


Ser 


TCG 


0.06 


0.08 


0.06 


0.14 


Ser 


TCA 


0.19 


0.19 


0.17 


0.11 


Ser 


TCT 


0.25 


' 0.25 


0.27 


0.15 


Ser 


TCC 


0.18 


0.10 


0.17 


0.25 


Arg 


AGG 


0.25 


0.09 


0.23 


0.26 


Arg 


AGA 


030 


030 


032 


0.09 


Arg 


CGG 


0.04 


0.14 


0.05 


0.13 


Arg 


CGA 


U.Uo 


0.14 


0,09 


0.04 


Arg 


CGT 


0.21 


0.09 


0.23 


0.12 


Arg 


CGC 


0.11 


0.05 


0.09 


0.36 


Gin 


CAG 


0.41 


0.18 


0.39 


0.46 


GlD 


CAA 


039 


0.82 


0.61 


0.54 


His 


CAT 


034 


0.90 


0.50 


0.33 


His 


CAC 


0.46 


0.10 


0.50 


0.67 


Leu 


TTG 


0.26 


0.08 


0.27 


0.14 


Leu 


TTA 


0.10 


0.46 


0.12 


0.03 


Leu 


CTG 


0.09 


0.04 


0.10 


0.2S 


Leu 


CTA 


0.08 


0.21 


0.10 


0.10 


Leu 


CTT 


0.28 


0.15 


0.18 


0.15 


Leu 


CTC 


0.19 


0.06 


0.22 


0.31 


Pro 


CCG 


0.09 


0.20 


0.08 


0.23 


Pro 


CCA 


0.42 


036 


0.44 


0.34 


Pro 


CCT 


032 


0.24 


032 


0.17 


Pro 


CCC 


0.17 


0.00 


0.16 


0.26 



154 coding sequences of dicot nuclear genes were used to compile the condon usage 
table. The pooled dicot coding sequences, obtained from Genbank (release 55) or, 
when no Genbank file name is specified, directly from the published source, were: 

GENUS/SPECIES GENBANK PROTEIN REI 



Amirtiiinum majus AMACHS 
Arabidopsis thalimia ATHADH 
ATHH3GA 
ATHH3GB 
ATHH4GA 
ATHLHCP1 
ATHTUBA 



Rertholletta cxcelsa 

Brassica campestris 

Brassica mpus BNANAP 

Brassica ol caeca BOLSLSGR 

Cattavalia ensiformis CENCONA 

Carica papaya CPAPAP 

CNamdomonas 



retnliardii 



CREC552 



Chalcone synthetase 
Alcphol dehydrogenase 
Hislone 3 gene 1 
Histone 3 gene 2 
Histone 4 gene 1 
CAB 
a tubulin 

5-enolpyruvyl4hyfate 3-phosphate 1 
synthetase 

High methionine storage protein 2 
Acyl carrier protein 3 
Napin 

S- locus specific glycoprotein 

Concanavalin A 

Papain 

Preapocytochrome 
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TABLE 1 -continued 


Frequency Codon Usage 




CRERBCS1 


RuBPC small subunit gene 1 




CRERBCS2 


RuBPC small subunit gene 2 


Cucurbita pepo 


CUCPHT . 


Phytochrome 


Cucumis sainnts 


CUSGMS 


Glyoxosomal malate synthetase 




CUSLHCPA 


CAB 




CUSSSU 


RuBCP small subunit 


Daucus carota 


DAREXT 


Extensin 




DAREXTR 


33 kD extensin related protein 


Dolidios biflonts 


DBILECS 


seed lectin 


Flaveria trincrvia 


FTRBCR 


RuBPC small subunit 


Glycine max 


SOY7SAA 


7S storage protein 




SOYACT1G 


Actin 1 




SOYCHPI 


CII protease inhibitor 




SOYGLYA1A 


Glycinin Ala Bx subunits 




SOYGLYAAB 


Glycinin A5A4B3 subunits 




SOYGLYAB 


Glycinin A3/b4 subunits 




SOYGLYR 


Glycinin A2Bla subunits 




SOYHSP175 


Low M W heal shock proteins 




SOYLGBI 


leghemoglobin 




SOYLEA 


Lectin 




SOYLOX 


Lipoxygenase 1 




SOYNOD20G 


20 kDa nodulin 




SOYNOD23G 


23 kDa nodulin 




SOYNOD24H 


24 kDa nodulin 




SOYNOD26B 


26 kDa nodulin 




SOYNOD26R 


26 kDa nodulin 




SOYNOD27R 


27 kDa nodulin 




SOYNOD35M 


35 kDa nodulin 




SOYNOD75 


75 kDa nodulin 




SOYNODR1 


Nodulin C51 




SOYNODR2 


Nodulin E27 




SOYPRP1 


Proline rich protein 




SOYRUBP 


RuBPC small subunit 




SOYURA 


Urease 




SOYHSP26A 


Heat shock protein 26A 



Gossypiwn hirsutitm 

Helianthus annus 

Ipomoea batatas 
Lemma gibba 

Lupinus luieus 

Lycopersicon 

esculenatm 



HNNRUBCS 



LGIAB19 

LGIR5BPC 

LUPLBR 

TDMBIOBR 

TOMETHYBR 

TOMPBC2AR 

TOMPSt 

TOMRBCSA 

TDMRBCSB 

TOMRBCSC 

TOMRBCSD 

TOMRRD 

TOMWIPIG 

TOMWIPII 



Medkago sariva 

Mcsembryanthennum 

crystallinum 

Nicotiana 

plunibaghufolia 



nicotiana tabacum 



ALFLB3R 



TOBATP21 



TOBECH 
TDBGAPA 



Nuclear-encoded chloroplast 
heat-shock protein 
22 kDa nodulin 
pi tubulin 
p2 tubulin 

Sted a globulin (vicilin) 

Seed p* globulin (vicilin) 

RuBPC small subunit 

2S albumin seed storage protein 

Wound-induced catalase 

CAB 

RuBPC small subunit 
leghemoglobin I 

Biotin binding protein 
Ethylene biosynthesis protein 
Pol yga lacturonase- 2a 
Tomato photosystem [ protein 
RuBPC small subunit 
RuBPC small subunit 
RuBPC small subunit 
RuBPC small subunit 
Ripening related protein 
Wound induced proteinase 
inhibitor I 

Wound induced proteinase 

inhibitor II 

CAB 1A 

CAB IB 

CAB 3C 

CAB 4 

CAB 5 

Leghemoglobin III 

RuBPC small subunit 

Mitochondrial ATP synthase 
P subunit 
Nitrate reductase 
Glutamine synthetase 
Endochitinase 

A subunit of chloroplast B3PD 



10 
10 
10 

11 
11 



12 



13 
14 
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TABLE 1 -continued 



Frequency Codon Usage 



Perseus americana 
Pctroselinum 
h or tense 
Petunia sp. 



Phascolus vulgaris 



Pisum sativum 



TOBGAPB B subunit of chloroplast G3PD 

TOBGAPC C subunit of chloroplast G3PD 

T0BPR1AR Pathogenesis related protein la 

TOBPR1CR Pathogenesis-related protein lc 

TOBPRPR Pathogenesis related protein lb 

TOBPXDLF Peroxidase 

TOBRBPCO RuBPC small subunit 

TOBTHAUR TMV- induced protein homologous 

to thaumatin 

AVOCEL Cellulase 

PHOCHL Chalconc synthase 

PETCA B13 CAB 13 

PETCA B22L CAB 22L 

PETCA B22R CAB 22R 

PETCA B25 CAB 25 

PETCA B37 CAB 37 

PETCA B91R CAB91R 

P ETC HSR Chalcone synthase 

PETGCR1 Glycine-rich protein 

PETRBCS08 RuBPC small subunit 

PETRBCS11 RuBPC small subunit 

70 LDa heat shock protein 15 

PHVCHM Chintinase 

PHVDLECA Phytohemagglutinin E 

PHVDLECB Phytohemagglutinin L 

PHVGSRl Glutamine synthetase 1 

PHVGSR2 Glutamine synthetase 2 

PHVLBA Leghemoglobin 

PHVLECT Lectin 

PHVPAL Phenylalanine ammonia lyase 

PHVPHASAR aphaseolin 

PHVPHASBR £ phaseolin 

A reel in seed protein 16 
Chalcone synthase 17 

PEAALB2 Seed albumin 

PEACAB80 CAB 

PEAGSR1 Glutamine synthetase (nodule) 

PEALECA Lectin 

PEALEGA Legumin 

PEARUBPS RuBPC small subunit 

PEAVIC2 Vicilin 

PEAVIC4 Vicilin 

PEAVIC7 VicUin 

Alcohol dehydrogenase 1 18 
Glutamine synthetase (leaf) 19 
Glutamine synthetase (root) 19 
Histone 1 20 
Nuclear encoded chloroplast 4 
heat shock proetin 

RuBPC small subunit 21 
RCCAGG Agglutinin 
RCCRICIN Ridn 
RCCICL4 Isocitrate lyase 

SIPFDX Ferrodoxin precursor 

SIPPCY Plastocyanin precursor 

SALGAPDH Nuclear gene for G3PD 
POTPAT Patatin 
POT1NHWI Wound- induced proteinase 

inhibitor 

POTLS1G Light -inducible tissue specific 

ST-LS1 gene 
POTP12G Wound- induced proteinase 

inhibitor II 
POTRBCS RuBPC small subunit 

Sucrose synthetase 22 
SPIACPt Acyl carrier protein I 

SPIOEC16 16 kDa photosynthetic 

oxygen-evolving protein 
SPIOEC23 23 kDa photosynthetic 

oxygen-evolving protein 
SPIPCG Plastocyanin 
SPIPS33 33 kDa photosynthetic water 

oxidation complex precursor 

Glycolate oxidase 23 



Raphanus satirvus 
Ricimts communis 



Silene pratertsis 

Sinapis alba 
Soiatutm tuberosum 



Sfivacia oleracca 
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TABLE 1 -continued 



Frequency Codon Usage 


Vicia faba 


VFALBA Leghemoglobin 






VFALEB4 Legumin B 






Vicillin 


24 



Pooled 53 monocot coding sequences obtained from Gcnbank (release 55) 
or, when no Genbank file name is specified, directly from the published 
source, were: 



GENUS/SPECIES GENBANK PROTEIN 



Avena sativa ASTAP3R 
Hordeunt vulgar* BLYALR 
BLYAMY1 
BLYAMY2 
BLYCHORD1 
BLYGLUCB 
BLYHORB 
BLYPAPI 
BLYTH1AR 
BLYUBIQR 



Oryia sativa RICGLUTG 

Triticum aesavum WHTAMYA 
WHTCAB 
WHTEMR 
WHTGIR 
WHTGLGB 
WHTGLIABA 
WHTGLUTl 
WHTH3 
WHTH4091 
WHTRBCB 
Secale cereale RYESECGSR 
Zea mays MZEA1G 

MZEACT1G 

MZEADH11F 

MZEADH2NR 

MZEALD 

MZEANT 

MZEEG2R 

MZEGGST3B 

MZEH3C2 

MZEH4C14 

MZEHSP701 

MZEHSP702 

MZELHCP 

MZEMPL3 

MZEPEPCR 

MZERBCS 

MZESUSYSG 

MZETP12 

MZEZEA20M 

MZEZEA30M 

MZEZE15A3 

MZEZE16 

MZEZE19A 

MZEZE22A 

MZEZE22B 



Phyto chrome 3 




Alcurain 




rv fimvtftc^ "1 
u oinyia&c 1 




a amylase 2 




Hordein C 

(K ctl 1 1 r*a iw 








Amylase/protease inhibitor 




Toxin ct hordothiooin 




Ubiquitio 




Histone 3 


25 


Leaf snecifir fhtnmn 1 


1A 


1 /> a f enfVMfir* thinnin 1 
LIllUnLll w 


10 


Plastocy&nin 


27 


Glutelin 




Glutelin 


iff 


01 amylase 




CAB 




Fin n tt\ f ^ i n 




gibberellin responsive protein 




f gliadin 




rt/R nlia/lin ria« All 

u/p giiaam vJaSS 




nign rvi w giuiemn 












RnRPO email citVitmi't 




J A^I><11 ill 




40.1 kD Al protein (NADPH- 




dependent reductase) 




Actin 




Alcohol dehydrogenase 1 




Alcohol dehydrogenase 2 




Aldolase 




ATP/ADP translocator 




Glutelin 2 




Glutathione S transferase 




Histone 3 




Histone 4 




70 kD Heat shock protein, exon 1 




70 kD Heat shock protein, exon 2 




CAB 




Lipid body surface protein L3 




Phosphoenolyruvate carboxylase 




RuBPC small summit 




Sucrose synthetase 




Triosephosphate isomerase 1 




19 kD zein 




19 kD zein 




15 kD zein 




16 kD zein 




19 kD zein 




22 kD zein 




22 kD zein 




Catalasc 2 


29 


Regulatory CI locus 


30 



Bt condons were obtained from analysis of coding sequences of the following genes: Bt 
var. kurstaki HD-73, 6.6 kb HindHI fragment (Kronstad et al. (1983) J. Bacleriol. 
154:419-^28): Bt var. kurstaki HD-1, 5.3 kb fragment (Adang et al. (1987) in Biotech- 
nology in Invertebrate Pathology and Cell Culture, K. Maramorosh (ed.), Academic Press, 
Inc. New York, pp. 85-99); Bt van kurstaki HD-1, 4.5 kb fragment (Schnepf and Whiteley 
(1985)J. Biol. Chem. 260:6273-6280); and Bt var. tenebrionis, 3.0 kb HindBJ fragment 
(Sekar et al. (1987) Proc. Natl. Acad. Sci. 84:7036-7040. 
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For example, dicots utilize the AAG codon for lysine with 
a frequency of 61% and the AAA codon with a frequency of 
39%. In contrast, in Bt proteins the lysine codons AAG and 
AAA are used with a frequency of 13% and 87%, respec- 
tively. It is known in the art that seldom used codons are 
generally detrimental to that system and mast be avoided or 
used judiciously. Thus, in designing a synthetic gene encod- 
ing the Btt crystal protein, individual amino acid codons 
found in the original Btt gene are altered to reflect the codons 
preferred by dicot genes for a . particular amino acid. 
However, attention is given to maintaining the overall 
distribution of codons for each amino acid within the coding 
region of the gene. For example, in the case of alanine, it can 
be seen from Table 1 that the codon GCA is used in Bt 
proteins with a frequency of 50%, whereas the codon GCT 
is the preferred codon in dicot proteins. In designing the 
synthetic Bit gene, not all codons for alanine in the original 
Bt gene are replaced by GCT; instead, only some alanine 
codons arc changed to GCT while others are replaced with 
different alanine codons in an attempt to preserve the overall 
distribution of codons for alanine used in dicot proteins. 
Column C in Table 1 documents that this goal is achieved; 
the frequency of codon usage in dicot proteins (column A) 
corresponds very closely to that used in the synthetic Btt 
gene (column C). 

In similar manner, a synthetic gene coding for insecticidal 
crystal protein can be optimized for enhanced expression in 
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monocot plants. In Table 1, column D, is presented the 
frequency of codon usage of highly expressed monocot 
proteins. 

Because of the degenerate nature of the genetic code, only 
part of the variation contained in a gene is expressed in this 
protein. It is clear that variation between degenerate base 
frequencies is not a neutral phenomenon since systematic 
codon preferences have been reported for bacterial, yeast 
and mammalian genes. Analysis of a large group of plant 
gene sequences indicates that synonymous codons are used 
differently by monocots and dicots. These patterns are also 
distinct from those reported for E. coli, yeast and man. 

In general, the plant codon usage pattern more closely 
resembles that of man and other higher eukaryotes than 
unicellular organisms, due to the overall preference for G+C 
content in codon position III. Monocots in this sample share 
the most commonly used codon for 13 of 18 amino acids as 
that reported for a sample of human genes (Grantham et al. 
(1986 supra), although dicots favor the most commonly used 
human codon in only 7 of 18 amino acids. 

Discussions of plant codon usage have focused on the 
differences between codon choice in plant nuclear genes and 
in chloroplasts. Chloroplasts differ from higher plants in that 
they encode only 30 tRNA species. Since chloroplasts have 
restricted their tRNA genes, the use of preferred codons by 
chloroplast-encoded proteins appears more extreme. 
However, a positive correlation has been reported between 
the level of isoaccepting tRNA for a given amino acid and 
the frequency with-which this codon is used in the chloro- 
plast genome (Pfitzinger et al. (1987) Nucl. Acids Res. 
15:1377-1386). 

Our analysis of the plant genes sample confirms earlier 
reports that the nuclear and chloroplast genomes in plants 
have distinct coding strategies. The codon usage of mono- 
cots in this sample is distinct from chloroplast usage, sharing 
the most commonly used codon for only 1 of 18 amino acids. 
Dicots in this sample share the most commonly used codon 
of chloroplasts in only 4 of 18 amino acids. In general, the 
chloroplast codon profile more closely resembles that of 
unicellular organisms, with a strong bias towards the use of 
A+T in the degenerate third base. 

In unicellular organisms, highly expressed genes use a 
smaller subset of codons than do weakly expressed genes 
although the codons preferred are distinct in some cases. 
Sharp and Li (1986) Nucl. Acids Res. 14:7734-7749 report 
that codon usage in 165 E. coli genes reveals a positive 
correlation between high expression and increased codon 
bias. Bennetzen and Hall (1982) supra have described a 
similar trend in codon selection in yeast. Codon usage in 
these highly expressed genes correlates with the abundance . 
of isoaccepting tRNAs in both yeast and E coli. It has been 
proposed that the good fit of abundant yeast and E. coli 
mRNA codon usage to isoacceptor tRNA abundance pro- 
motes high translation levels and high steady slate levels of 
these proteins. This strongly suggests that the potential for 
high levels of expression of plant genes in yeast or E. coli 
is limited by their codon usage. Hoekema et al. (1987) supra 
report that replacement of the 25 most favored yeast codons 
with rare codons in the 5' end of the highly expressed gene 
PGK1 leads to a decrease in both mRNA and protein. These 
results indicate that codon bias should be emphasized when 
engineering high expression of foreign genes in yeast and 
other systems. 

(iii) Sequences within the Btt Coding Region having 
Potentially Destabilizing Influences 

Analysis of the Btt gene reveals that the A+T content 
represents 64% of the DNA base composition of the coding 
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region. This level of A+T is about 10% higher than that 
found in a typical plant coding region. Most often, high A+T 
regions are found in intergenic regions. Also, many plant 
regulatory sequences are observed to be AT-rich. These 
observations lead to the consideration that an elevated A+T 5 
content within the Btt coding region may be contributing to 
a low expression level in plants. Consequently, in designing 
a synthetic Btt gene, the A+T content is decreased to more 
closely approximate the A+T levels found in plant proteins. 
As illustrated in Table 3, the A+T content is lowered to a ao 
level in keeping with that found in coding regions of plant 
nuclear genes. The synthetic Bit gene of this invention has 
an A+T content of 55%. 



TABLE 3 


Adenine + Thymine Content in Rtt CrxKno R^i™ 






Base 




Coding Region 


G A T C %G+C 


% A+-T 


Natural Btt gene 
Synthetic Btt gene 


341 633 514 306 36 
392 530 483 428 45 


64 

55 



In addition, the natural Btt gene is scanned for sequences 
that are potentially destabilizing to Btt RNA. These 2 
sequences, when identified in the original Btt gene, are 
eliminated through modification of nucleotide sequences. 
Included in this group of potentially destabilizing sequences 
are: 

(a) plant polyadenylation signals (as described by Joshi 
(1987) NucL Acids Res. 15:9627-9640). In eukaryotes, 
the primary transcripts of nuclear genes are extensively 
processed (steps including 5' — capping, intron spacing, 
polyadenylation) to form mature and translatable 3 
mRNAs. In higher plants, polyadenylation involves 
eadonucleolytic cleavage at the polyAsite followed by 
the addition of several A residues to the cleaved end. 
The selection of the polyA site is presumed to be 
cis-regulated. During expression of Bt protein and RNA ^ 
in different plants, the present inventors have observed 
that the polyadenylated mRNA isolated from these 
expression systems is not full-length but instead is 
truncated or degraded. Hence, in the present invention 
it was decided to minimize possible destabilization of 4 , 
RNA through elimination of potential polyadenylation 
signals within the coding region of the synthetic Btt 
gene. Plant polyadenylation signals including 
AATAAA, AATGAA, AATAAT, AATAIT, GATAAA, 
GATAAA, and AATAAG motifs do not appear in the 5C 
synthetic Btt gene when scanned for 0 mismatches of 
the sequences. 

(b) polymerase II termination sequence, CAN 7 . 
9AGTNNAA. This sequence was shown (Vankan and 
Filipowicz (1988) EMBO J. 7:791-799) to be next to 55 
the 3' end of the coding region of the U2 snRNA genes 
otArabidopsis th a liana and is believed to be important 
for transcription termination upon 3' end processing. 
The synthetic Btt gene is devoid of this termination 
sequence. 60 

(c) CUUCGG hairpins, responsible for extraordinarily 
stable RNA secondary structures associated with vari- 
ous biochemical processes (Tuerk et al. (1988) Proc. 
Natl. Acad. Sci. 85:1364-1368). The exceptional sta- 
bility of CUUCGG hairpins suggests that they have an 65 
unusual structure and may function in organizing the 
proper folding of complex RNA structures. CUUCGG 
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hairpin sequences are not found with either 0 or 1 
mismatches in the Btt coding region, 
(d) plant consensus splice sites, 5'=AAG:GTAAGT (SEQ 
ID NO. 6) and 3'=TITr(Pu)TTr(Pu)T(Pu)lXlMT(Pu) 
TGCAG:C (SEQ ID NO. 7), as described by Brown et 
al. (1986) EMBO J. 5:2749-2758. Consensus 
sequences for the 5 ! and 3' splice junctions have been 
derived from 20 and 30 plant intron sequences, respec- 
tively. Although it is not likely that such potential splice 
sequences are present in Bt genes, a search was initiated 
for sequences resembling plant consensus splice sites in 
the synthetic Btt gene. For the 5' splice site, the closest 
match was with three mismatches. This gave 12 
sequences of which two had G:GT. Only position 948 
was changed because 1323 has the Kpnl site needed for 
reconstruction. The 3'-splice site is not found in the 
synthetic Btt gene. 
Thus, by highlighting potential RNA-destabilizing 
sequences, the synthetic Btt gene is designed to eliminate 
known eukaryotic regulatory sequences that affect RNA 
synthesis and processing. 

Example 2 

Chemical Synthesis of a Modified Btt Structural Gene 
(i) Synthesis Strategy 

The general plant for synthesizing linear double-stranded 
DNA sequences coding for the crystal protein from Btt is 
schematically simplified in FIG. 2. The optimized DNA 
coding sequence (FIG. 1) is divided into thirteen segments 
(segments A-M) to be synthesized individually, isolated and 
purified. As shown in FIG. 2, the general strategy begins by 
enzymatically joining segments A and M to form segments 
AM to which is added segment BL to form segment ABLM . 
Segment CK is then added enzymatically to make segment 
ABCKLM which is enlarged through addition of segments 
DJ, EI and RFH sequentially to give finally the total segment 
ABCDEFGHIJKLM, representing the entire coding region 
of the Btt gene. 

FIG. 3 outlines in more detail the strategy used in com- 
bining individual DNA segments in order to effect the 
synthesis of a gene having unique restriction sites integrated 
into a defined nucleotide sequence. Each of the thirteen 
segments (A to M) has unique restriction sites at both ends, 
allowing the segment to be strategically spliced into a 
growing DNA polymer. Also, unique sites are placed at each 
end of the gene to enable easy transfer from one vector to 
another. 

The thirteen segments (A to M) used to construct the 
synthetic gene vary in size. Oligonucleotide pairs of 
approximately 75 nucleotides each are used to construct 
larger segments having approximately 225 nucleotide pairs. 
FIG. 3 documents the number of base pairs contained within 
each segment and specifies the unique restriction sites 
bordering each segment. Also, the overall strategy to incor- 
porate specific segments at appropriate splice sites is 
detailed in FIG. 3. 
(ii) Preparation of Oligodeoxynucleotides 
Preparation of oligodeoxynucleotides for use in the syn- 
thesis of a DNA sequence comprising a gene for Btt is 
carried out according to the general procedures described by 
Matleucci et al. (1981) J. Am. Chem. Soc. 103:3185-3192 
and Beaucage et al. (1981) Tetrahedron Lett. 22:1859-1862. 
All oligonucleotides are prepared by the solid-phase phos- 
phoramidite triester coupling approach, using an Applied 
Biosystems Model 380A DNA synthesizer. Deprotection 
and cleavage of the oligomers from the solid support are 
carried out according to standard procedures. Crude oligo- 
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nucleotide mixtures are purified using an oligonucleotide 
purification cartridge (OTC, Applied Biosy stems) as 
described by McBridc el al. (1988) Biotechniques 
6:362-367. 

.^'-phosphorylation of oligonucleotides is performed with 
T4 polynucleotide kinase. The reaction contains 2 /<g oli- 
gonucleotide and 18.2 units polynucleotide kinase 
(Pharmacia) in linker kinase buffer (Maniatis (1982) Clon- 
ing Manual, Fritsch and Sambrook (eds.) Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y.). The reaction 
is incubated at 37° C for 1 hour. 
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M segment DNA are used to transform E. coli MC1061. 
Colonies containing inserted blocks are identified by colony 
hybridization with 32 P-labeIled oligonucleotide probes. The 
sequence of the DNA segment is confirmed by isolating 
plasmid DNA and sequencing using the dideoxy method of 
Sanger et al. (1977) Proc. Natl. Acad. Sci. 74:5463-5467. 
(iii) Synthesis of Segment AM 

Three oligonucleotide pairs (Al and its complementary 
strand Ale, A2 and A2c and A3 and A3c) are assembled and 
ligated as described above to make up segment A. The 
nucleotide sequence of segment A is as follows: 



TABLE 4 



Nucleotide Sequence of Segment A 



BamHI 

XhoII PstI 
I I 
AATTGGGATCCAAC AATGGC TGCAGACAAC AACACGGAGGCCCTCGATAGCTCTACCACC 
1 + + + + + + 6Q 

CCCTAGOTTGTTACCGACGTCTGTTGTTGTGCCTCCGGGAGCTATCGAGATGGTGG 
EcoRl end 

MAADNNTEALDSSTT 



AAAGATGTCATlrcAGAAGGGCATCTCCGTTGTGGGTGAT^ 

61 + ' l| + + + + 120 

T TT C T AC AGTAAGTCTT CC C^TAGAGGCAAC AC CC AC TAGAGGAACC GC AACAAC CAAAG 

KDVIQKG ISVVGDLLGVVGF 



AL (71 bases) 
Ale* (76 bases) 



BspXII 



CCC TTTGGTGGTGCCCTTGTTTCI 
121 - 



ITACACTAACTTTCTGAATACTATTTGGCCCAGC 



GGGAAACCACCACGGGAACAAAGCAAGATGTGATTC^VAAGACTTATGATAAACCGGGTCG 
PFGGALVSPYTNFLNTIWPS 



A2 (75 bases) 
A2c (76 bases) 



181 — 



XhoII 

Bglll EcoRI 
Hindu I I Xbal I 

I I I I end 

. G AAGACCCTTGGAAGGC TTTT ATGGAGCAAGTGGAAGCTTAGATCTAG 



CTTCTGGGAACCTTCCGAAAATACCTCGTTCACCTTCGAATCTAGATCTTAA 
EDPWKAFMEQVE 



232 



A3 (82 bases) 
A3c (76 bases) 



*c = complementary strand. 



Oligonucleotides arc annealed by first heating to 95° C. 
for 5 min. and then allowing complementary pairs to cool 
slowly to room temperature. Annealed pairs are reheated to 
65° C, solutions are combined, cooled slowly to room 
temperature and kept on ice until used. The ligated mixture 
may be purified by electrophoresis through a 4% NuSieve 
agarose (FMC) gel. The band corresponding to the ligated 
duplex is excised, the DNA is extracted from the agarose and 
ethanol precipitated. 

Ligations are carried out as exemplified by that used in M 
segment ligations. M segment DNA is brought to 65° C. for 
25 min, the desired vector is added and the reaction mixture 
is incubated at 65° C. for 15 rain. The reaction is slow cooled 
over VA hours to room temperature. ATP to (K5 mM and 3.5 
units of T4 DNA ligase salts are added and the reaction 
mixture is incubated for 2 hr at room temperature and then 
maintained overnight at 15° C. The next morning, vectors 
which had not been ligated to M block DNA were removed 
upon linearization by EcoRI digestion. Vectors ligated to the 
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In Table 4, bold lines demarcate the individual oligonucle- 
otides. Fragment Al contains 71 bases, Ale has 76 bases, A2 
has 75 bases, A2c has 76 bases, A3 has 82 bases and A3c has 
76 bases. In all, segment A is composed of 228 base pairs 
55 and is contained between EcoRI restriction enzyme site and 
one destroyed EcoRI site (5')X (Additional restriction sites 
within Segment A are indicated.) The EcoRI single-stranded 
cohesive ends allow segment A to be annealed and then 
ligated to the EcoRI-cut cloning vector, pIC20K. 

60 

Segment M comprises three oligonucleotide pairs: Ml, 80 
bases, Mlc, 86 bases, M2, 87 bases, M2c, 87 bases, M3, 85 
bases and M3c 79 bases. The individual oligonucleotides are 

65 annealed and ligated according to standard procedures as 
described above. The overall nucleotide sequence of seg- 

. raent M is: 
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TABLE 5 



Nucleotide Sequence of Segment M 



Hindlll BspXII 
Eco I Banlll 

Rl-end I | | 

AATTAAGCTTGGACGGGGCTCCATTCAACCAATACTACTTCGATAAGACCATCARCAAAG 
1 + + + + 4 4 60 

TTCGAACCTGCCCCGAGGTAAGTTGGTTATGATGAAGCTATTCTGGTAGTTGTTTC 



SLDGAPFNQYYFDKTINKG 



ASUlI 

. I 

GAGACACACTCACGTATAATtrCCTTCAACTTAGCCAGCTTCAGCACTCCATTCGAATTGT 

61 + \ \ i. 4 + + 120 

CTC TGTGTG AGTCCATATTAAGG AAGTTGi^TCGGTC GAAGTCGTGAGGTAAGCTTAAC A 

DTLTYNSFNLASFST PFEL S 



AccI 

Ahall TthI I 

I I I 

CAGGGAACAACTTGCAGATAGGCGTCACAGGATTGAGTGCTGGTGACtvAGGTCTACATCG 

121 + + + 4 II |-4 180 

GTCCCTTGTTGAACGTCTATCCGCAGTGTCCTAACTCACGACCACTGTTCCAGATGaRGC 

GNNLQIGVTGLSAGDKVYID 



Ml (80 bases) 
Mlc* (86 bases) 



M2 (87 bases) 
N2c (87 bases) 



Mstll 
I 

ACAAGATTGAGTTCATTCCAGTGAACCTTAGGTCCCCAGGAACCGAGCTTGAGTTCA^ 

181 + + + 4 + + 24 0 

TGTTCTAACTCAAGTAAGGTCACTTGGAATCCAGGGGTCCTTGGCTCGAACTCAAGTAGC 

KIEFIPVNLRSPGTELEFID 



Bglll 
Xholl 
Xbal I 
I I 
ACATCTAGATCT 

241 + 256 

TGTAGATCTAGATTAA 



M3 (85 bases) 
M3c (79 bases) 



252 BASES (TOTAL) 



*c « complementary strand 

In Table 5 bold lines demarcate the individual oligonucle- 
otides. Segment M contains 252 base pairs and has 
destroyed EcoRI, restriction sites at both ends. (Additional 
restriction sites within segment M are indicated). Segment 
M is inserted into vector pIC20R at an EcoRI restriction site 
and cloned. 

As proposed in FIG. 3, segment M is joined to segment A 
in the plasmid in which it is contained. Segment M is excised 
at the flanking restrictions sites from its cloning vector and 
spliced into pIC20K, harboring segment A, through succes- 
sive digestions with Hindlll followed by Bqlll. The pIC20K 
vector now comprises segment A joined to segment M with 
a Hindlll site at the splice site (see FIG. 3). Plasmid pIC20K 
is derived from pIC20R by removing the Scal-Ndel DNA 
fragment and inserting a Hindi fragment containing an 
NPTI coding region. The resulting plasmid of 4.44 kb 
confers resistance to kanamycin on E. coli. 

Example 3 

Expression of Synthetic Crystal Protein Gene in Bacterial 
Systems 

The synthetic Btt gene is designed so that it is expressed 
in the pIC20R-kan vector in which it is constructed. This 



50 



45 

expression is produced utilizing the initiation methionine of 
the lacZ protein of pIC20K. The wild-type Btt crystal protein 
sequence expressed in this manner has full insecticidal 
activity. In addition, the synthetic gene is designed to 
contain a BamHI site 5' proximal to the initiating methionine 
codon and a Bglll site 3 f to the terminal TAG translation stop 
codon. This facilitates the cloning of the insecticidal crystal 
protein coding region into bacterial expression vectors such 
55 as pDR540 (Russell and Bennett, 1982). Plasmid pDR540 
contains the TAG promoter which allows the production of 
proteins including Btt crystal protein under controll&d con- 
ditions in amounts up to 10% of the total bacterial protein. 
This promoter functions in many gram-negative bacteria 
including E. coli and Pseudomonas. 

Production of Bt insecticidal crystal protein from the 
synthetic gene in bacteria demonstrates that the protein 
produced has the expected toxicity to coleopteran insects. 
These recombinant bacterial strains in themselves have 
potential value as microbial insecticides, product of the 
synthetic gene. 



60 



65 
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Example 4 

Expression of a Synthetic Crystal Protein Gene in Plants 

The synthetic Btt crystal protein gene is designed to 
facilitate cloning into the expression cassettes. These utilize 
sites compatible with the Bam HI and Ball I restriction sites 5 
flanking the synthetic gene. Cassettes are available that 
utilize plant promoters including CaMV 35 S, CaMV 19S 
and the ORF 24 promoter from T-DNA. These cassettes 
provide the recognition signals essential for expression of 
proteins in plants. These cassettes are utilized in the micro 10 
Ti plasm ids such as pH575. Plasmids such as pH575 con- 
taining the synthetic Btt gene directed by plant expression 
signals are utilized in disarmed Agrobacterium tumefaciens 
to introduce the synthetic gene into plant genomic DNA. 
This system has been described previously by Adang et al. 15 
(1987) to express Bt var. kurstaki crystal protein gene in 
tobacco plants. These tobacco plants were toxic to feeding 
tobacco hornworms. 
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Example 5 
Assay for Insecticidal Activity 

Bioassays are conducted essentially as described by 
Sekar, V. et al. supra. Toxicity is assessed by an estimate of 
the LD 50 . Plasmids are grown in E. coli JM105 (Yanisch- 
Perron, C. et al. (1985) Gene 33: 103-119). On a molar basis, 
no significant differences in toxicity are observed between 
crystal proteins encoded by p544Pst-Met5, p544-HindIII, 
and pNSBP544. When expressed in plants under identical 
conditions, cells containing protein encoded by the synthetic 
gene are observed to be more toxic than those containing 
protein encoded by the native Btt gene. Immunoblots 
("western" blots) of cell cultures indicated that those that are 
more toxic have more crystal protein antigen. Improved 
expression of the synthetic Btt gene relative to that of a 
natural Btt gene is seen as the ability to quantitate specific 
mRNA transcripts from expression of synthetic Btt genes on 
Northern blot assays. 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES! 9 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ATGACTGCAG 


ATAATAATAC 


GGAAGCACTA 


GATAGCTCTA 


CAACAAAAGA 


TGTCATTCAA 


60 


AAAGGCATTT 


CCGTAGTAGG 


TGATCTCCTA 


GGCGTAGTAG 


GTTTCCCGTT 


TGGTGGAGCG 


120 


CTTGTTTCGT 


TTTATACAAA 


CTTTTTAAAT 


ACTATTTGGC 


CAAGTGAAGA 


CCCGTGGAAG 


180 


GCTTTTATGG 


AACAAGTAGA 


AGCATTGATG 


GATCAGAAAA 


TAGCTGATTA 


TGCAAAAAAT 


240 


AAAGCTCTTG 


CAGAGTTACA 


GGGCCTTCAA 


AATAATGTCG 


AAGATTATGT 


GAGTGCATTG 


300 


AGTTCATGGC 


AAAAAAATCC 


TGTGAGTTCA 


CGAAATCCAC 


ATAGCCAGGG 


GCGGATAAGA 


360 


GAGCTGTTTT 


CTCAAGCAGA 


AAGTCATTTT 


CGTAATTCAA 


TGCCTTCGTT 


TGCAATTTCT 


420 


GGATACGAGG 


TTCTATTTCT 


AACAACATAT 


GCACAAGCTG 


CCAACACACA 


TTTATTTTTA 


480 


CTAAAAGACG 


CTCAAATTTA 


TGGAGAAGAA 


TGGGGATACG 


AAAAAGAAGA 


TATTGCTGAA 


540 


TTTTATAAAA 


GACAACTAAA 


ACTTACGCAA 


GAATATACTG 


ACCATTGTGT 


CAAATGGTAT 


600 


AATGTTGGAT 


TAGATAAATT 


AAGAGGTTCA 


TCTTATGAAT 


CTTGGGTAAA 


CTTTAACCGT 


660 


TATCGCAGAG 


AGATGACATT 


AACAGTATTA 


GATTTAATTG 


CACTATTTCC 


ATTGTATGAT 


720 


GTTCGGCTAT 


ACCCAAAAGA 


AGTTAAAACC 


GAATTAACAA 


GAGACGTTTT 


AACAGATCCA 


780 


ATTGTCGGAG 


TCAACAACCT 


TAGGGGCTAT 


GGAACAACCT 


TCTCTAATAT 


AGAAAATTAT 


840 


ATTCGAAAAC 


CACATCTATT 


TGACTATCTG 


CATAGAATTC 


AATTTCACAC 


GCGGTTCCAA 


"900 


CCAGGATATT 


ATGGAAATGA 


CTCTTTCAAT 


TATTGGTCCG 


GTAATTATGT 


TTCAACTAGA 


960 


CCAAGCATAG 


GATCAAATGA 


TATAATCACA 


TCTCCATTCT 


ATGGAAATAA 


ATCCAGTGAA 


1020 


CCTGTACAAA 


ATTTAGAATT 


TAATGGAGAA 


AAAGTCTATA 


GAGCCGTAGC 


AAATACAAAT 


1080 


CTTGCGGTCT 


GGCCGTCCGC 


TGTATATTCA 


GGTGTTACAA 


AAGTGGAATT 


TAGCCAATAT 


1140 
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-continued 

AATGATCAAA CAGATGAAGC AAGTACACAA ACGTACGACT CAAAAAGAAA TGTTGGCGCG 1200 

GTCAGCTGGG ATTCTATCGA TCAATTGCCT CCAGAAACAA CAGATGAACC TCTAGAAAAG 1260 

GGATATAGCC ATCAACTCAA TTATGTAATG TGCTTTTTAA TGCAGGGTAG TAGAGGAACA 1320 

ATCCCAGTGT TAACTTGGAC ACATAAAAGT GTAGACTTTT TTAACATGAT TGATTCGAAA 1380 

AAAATTACAC AACTTCCGTT AGTAAAGGCA TATAAGTTAC AATCTGGTGC TTCCGTTGTC 1440 

GCAGGTCCTA GGTTTACAGG AGGAGATATC ATTCAATGCA CAGAAAATGG AAGTGCGGCA 1500 

ACTATTTACG TTACACCGGA TGTGTCGTAC TCTCAAAAAT ATCGAGC TAG AATTCATTAT 1560 

GCTTCTACAT CTCAGATAAC ATTTACACTC AGTTTAGACG GGGCACCATT TAATCAATAC 1620 

TATTTCGATA AAACGATAAA TAAAGGAGAC ACATTAACGT ATAATTCATT TAATTTAGCA 1680 

AGTTTCAGCA CACCATTCGA ATTATCAGGG AATAACTTAC AAATAGGCGT CACAGGATTA 174 0 

AGTGCTGGAG ATAAAGTTTA TATAGACAAA ATTGAATTTA TTCCAGTGAA TTAA 1794 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 
15 10 15 

Asp Val He Gin Lys Gly He Ser Val Val Gly Asp Leu Leu Gly Val 
20 25 30 

Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 
35 40 - 45 

Leu Asn Thr He Trp Pro Ser Glu Asp Pro Trp Lye Ala Phe Met Glu 
50 55 60 

Gin Val Glu Ala Leu Met Asp Gin Lys He Ala Asp Tyr Ala Lys Asn 
65 70 75 80 

Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Val Glu Asp Tyr 
85 90 95 

Val Ser Ala Leu Ser Ser Trp Gin Lys Asn Pro Val Ser Ser Arg Asn 
100 105 HO 

Pro His Ser Gin Gly Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
115 120 125 

Hie Phe Arg Asn Ser Met Pro Ser Phe Ala He Ser Gly Tyr Glu Val 
130 135 140 

Leu Phe Leu Thr Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Phe Leu 
145 150 155 160 

Leu Lys Asp Ala Gin He Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu 
165 170 175 

Asp He Ala Glu Phe Tyr Lys Arg Gin Leu Lys Leu Thr Gin Glu Tyr 
180 185 190 

Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg 
195 200 205 

Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu 
210 215 220 

Met Thr Leu Thr Val Leu Asp Leu He Ala Leu Phe Pro Leu Tyr Asp 
225 230 235 240 
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Val Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val 
245 250 255 

Leu Thr Asp Pro He Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr 
260 265 270 

Thr Phe Ser Asn He Glu Asn Tyr He Arg Lys Pro His Leu Phe Asp 
275 280 285 

Tyr Leu His Arg He Gin Phe His Thr Arg Phe Gin Pro Gly Tyr Tyr 
290 295 300 

Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg 
305 310 315 320 

Pro Ser lie Gly Ser Asn Asp He He Thr Ser Pro Phe Tyr Gly Asn 
325 330 335 

Lys Ser Ser Glu Pro Val Gin Asn Leu Glu Phe Asn Gly Glu Lys Val 
340 345 350 

Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val 
355 360 365 

Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gin Tyr Asn Asp Gin Thr 
370 375 380 

Asp Glu Ala Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala 
385 390 395 400 

Val Ser Trp Asp Ser He Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu 
405 410 415 

Pro Leu Glu Lys Gly Tyr Ser His Gin Leu Asn Tyr Val Met Cys Phe 
420 425 430 

Leu Met Gin Gly Ser Arg Gly Thr He Pro Val Leu Thr Trp Thr His 
435 440 445 

Lys Ser Val Asp Phe Phe Asn Met He Asp Ser Lys Lys He Thr Gin 
450 455 460 

Leu Pro Leu Val Lys Ala Tyr Lys- Leu Gin Ser Gly Ala Ser Val Val 
465 470 475 480 

Ala Gly Pro Arg Phe Thr Gly Gly Asp He He Gin Cys Thr Glu Asn 
485 490 495 

Gly Ser Ala Ala Thr He Tyr Val Thr Pro Asp Val Ser Tyr Ser Gin 
500 505 510 

Lys Tyr Arg Ala Arg He His Tyr Ala Ser Thr Ser Gin He Thr Phe 
515 520 525 

Thr Leu Ser Leu Asp Gly Ala Pro Phe Asn Gin Tyr Tyr Phe Asp Lys 
530 535 540 

Thr He Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu Ala 
545 550 555 560 

Ser Phe Ser Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gin He Gly 
565 570 575 

Val Thr Gly Leu Ser Ala Gly Asp Lys Val Tyr lie Asp Lys He Glu 
580 585 590 

Phe He Pro Val Asn 
595 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ATGGCTGCAG ACAACAACAC GGAGGCCCTC GATAGCTCTA CCACCAAAGA TGTCATTCAG 60 

AAGGGCATCT CCGTTGTGGG TGATCTCCTT GGCGTTGTTG GTTTCCCCTT TGGTGGTGCC 120 

CTTGTTTCGT TCTACACTAA CTTTCTGAAT ACTATTTGGC CCAGCGAAGA CCCTTGGAAG 180 

GCTTTTATGG AGCAAGTGGA AGCTTTGATG GATCAGAAGA TCGCTGATTA TGCAAAGAAC 240 

AAAGCTCTTG CTGAGCTCCA GGGCCTTCAG AACAACGTCG AAGATTATGT GAGTGCACTG 300 

AGTTCATGGC AAAAGAATCC TGTGTCCTCA CGAAATCCAC ATAGCCAGGG GCGCATAAGG 360 

GAGCTGTTCT CTCAAGCAGA AAGTCACTTC CGGAATTCAA TGCCTTCCTT TGCCATCTCT 420 

GGGTACGAGG TTCTCTTTCT TACAACCTAC GCTCAAGCTG CCAACACACA TCTGTTCTTA 480 

CTAAAAGACG C TC AAATCTA TGGTGAAGAA TGGGGATACG AGAAAGAAGA TATCGCTGAG 540 

TTCTACAAGC GTCAACTAAA ACTTACTCAA GAGTATACTG ACCACTGTGT CAAATGGTAT 600 

AATGTTGGAT TGGATAAGTT GAGAGGTTCA TCTTATGAAT CTTGGGTAAA CTTTAACCGG 660 

TACCGCAGAG AGATGACATT GACAGTGCTC GACTTGATTG CACTATTTCC ATTGTATGAT 720 

GTTCGACrCT ACCCAAAGGA GGTTAAAACC GAATTGACTA GAGACGTTTT AACCGATCCC 780 

ATTGTCGGAG TCAACAACCT CAGAGGCTAC GGAACAACCT TCTCTAACAT AGAAAACTAC 84 0 

ATTCGTAAAC CACATCTATT CGACTATCTG CACAGAATTC AGTTTCACAC GCGGTTCCAA 900 

CCAGGATACT ATGGAAATGA CTCTTTCAAC TATTGGTCCG GTAATTATGT TTCAACTAGA 960 

CCCAGCATAG GATCTAATGA CATCATCACC TCTCCATTCT ACGGAAACAA GTCCTCCGAG 1020 

CCTGTGCAAA ACTTGGAGTT TAATGGAGAG AAAGTCTATA GAGCCGTGGC CAATACCAAT 1080 

CTTGCCGTCT GGCCGTCCGC TGTGTACTCA GGTGTTACCA AAGTGGAATT CAGCCAATAC 1140 

AATGATCAGA CAGATGAAGC AAGTACTCAA ACTTACGACT CAAAGAGGAA TGTTGGCGCG 1200 

GTCAGCTGGG ATTCTATCGA TCAACTCCCT CCAGAAACCA CCGATGAACC TCTAGAGAAG 1260 

GGTTATAGCC ATCAACTCAA TTACGTAATG TGCTTTCTCA TGCAGGGTAG TAGAGGTACC 1320 

ATCCCAGTGT TAACTTGGAC TCACAAGAGT GTAGACTTCT TCAACATGAT TGATTCGAAA 1380 

AAGATTACTC AACTTCCGTT GGTAAAGGCC TACAAGTTAC AATCTGGTGC TTCCGTTGTC 144 0 

GCAGGTCCTA GGTTTACAGG AGGAGATATC ATTCAATGCA CTGAGAATGG GTCCGCGGCA 1500 

ACTATCTACG TTACACCTGA TGTGTCGTAC TCTCAAAAGT ATCGTGCTAG AATTCATTAT 1560 

GCTTCTACCT CTCAGATAAC ATTCACACTA AGCTTGGACG GGGCTCCATT CAACCAATAC 1620 

TACTTCGATA AGACCATCAA CAAAGGAGAC ACACTCACGT ATAATTCATT CAACTTAGCC 1680 

AGCTTCAGCA CTCCATTCGA ATTGTCAGGG AACAACTTGC AGATAGGCGT CACAGGATTG 1740 

AGTGCTGGTG ACAAGGTTTA CATCGACAAG ATTGAGTTCA TTCCAGTGAA CCTTAGGTCC 1800 

CCAGGAACCG AGCTTGAGTT CATCGACATC TAG 1833 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 610 amino acids 

(B) TYPE: anu.no acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:4t 

Met Ala Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 
15 10 15 
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Asp Val He Gin Lys Gly He Ser Vol Val Gly Asp Leu Leu Gly Vol 
20 25 30 

Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 
35 40 45 

Leu Asn Thr He Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 
50 55 60 

Gin Val Glu Ala Leu Met Asp Gin Lys He Ala Asp Tyr Ala Lys Asn 
65 70 75 80 

Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Val Glu Asp Tyr 
85 90 95 

Val Ser Alo Leu Ser Ser Trp Gin Lys Asn Pro Val Ser Ser Arg Asn 
100 105 HO 

Pro His Ser Gin Gly Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
115 120 125 

His Phe Arg Asn Ser Met Pro Ser Phe Ala He Ser Gly Tyr Glu Val 
130 135 140 

Leu Phe Leu Thr Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Phe Leu 
145 150 155 160 

Leu Lys Asp Ala Gin lie Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu 
165 170 175 

Asp He Ala Glu Phe Tyr Lys Arg Gin Leu Lys Leu Thr Gin Glu Tyr 
180 185 190 

Thr Asp His cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg 
195 200 205 

Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu 
210 215 220 

Met Thr Leu Thr Val Leu Asp Leu He Ala Leu Phe Pro Leu Tyr Asp 
225 230 235 240 

Val Arg Leu Tyr Pro Lys Glu Val -Lys Thr Glu Leu Thr Arg Asp Val 
245 250 255 

Leu Thr Asp Pro He Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr 
260 265 270 

Thr Phe Ser Asn He Glu Asn Tyr He Arg Lys Pro His Leu Phe Asp 
275 280 285 

Tyr Leu His Arg He Gin Phe His Thr Arg Phe Gin Pro Gly Tyr Tyr 
290 295 300 

Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg 
305 310 315 320 

Pro Ser He Gly Ser Asn Asp He He Thr Ser Pro Phe Tyr Gly Asn 
325 330 335 

Lys Ser Ser Glu Pro Val Gin Asn Leu Glu Phe Asn Gly Glu Lys Val 
340 345 350 

Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val 
355 360 365 

Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gin Tyr Asn Asp Gin Thr 
370 375 380 

Asp Glu Ala Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala 
385 390 395 400 

Val Ser Trp Asp Ser He Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu 
405 410 415 

Pro Leu Glu Lys Gly Tyr Ser His Gin Leu Asn Tyr Val Met Cys Phe 
420 425 430 

Leu Met Gin Gly Ser Arg Gly Thr He Pro Val Leu Thr Trp Thr His 
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435 440 445 

Lys Ser Val Asp Phe Phe Asn Met He Asp Ser Lys Lys He Thr Gin 
450 455 460 

Leu Pro Leu Val Lys Ala Tyr Lys Leu Gin Ser Gly Ala Ser Val Val 
465 470 475 480 

Ala Gly Pro Arg Phe Thr Gly Gly Asp He He Gin Cys Thr Glu Asn 
485 490 495 

Gly Ser Ala Ala Thr He Tyr Val Thr Pro Asp Val Ser Tyr Ser Gin 
500 505 510 

Lye Tyr Arg Ala Arg He Hie Tyr Ala Ser Thr Ser Gin He Thr Phe 
515 520 525 

Thr Leu Ser Leu Asp Gly Ala Pro Phe Asn Gin Tyr Tyr Phe Asp Lys 
530 535 540 

Thr He Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu Ala 
545 550 555 560 

Ser Phe Ser Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gin He Gly 
565 570 575 

Val Thr Gly Leu Ser Ala Gly Asp Lys Val Tyr He Asp Lys He Glu 
580 585 590 

Phe He Pro Val Asn Leu Arg Ser Pro Gly Thr Glu Leu Glu Phe He 
595 600 605 

Asp I le 
610 



(2) IU FORMAT IOK FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CAGGATCCAA CAATGAC 17 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
AAGGTAAGT 9 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



TTTTUTTTUT UTUTUTGCAG C 



21 
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<2) INFORMATION FOR SEQ ID NO: 8: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



AATTGGGATC 


CAACAATGGC 


TGCAGACAAC 


AACACGGAGG CCCTCGATAG CTCTACCACC 


60 


CCCTAGGTTG 


TTACCGACGT 


CTGTTGTTGT 


GCCTCCGGGA GCTATCGAGA TGGTGGAAAG 


120 


ATGTCATTCA 


GAAGGGCATC 


TCCGTTGTGG 


GTGATCTCCT TGGCGTTGTT GGTTTCTTTC 


180 


TACAGTAAGT 


CTTCCCGTAG 


AGGCAACACC 


CACTAGAGGA ACCGCAACAA CCAAAGCCCT 


240 


TTGGTGGTGC 


CCTTGTTTCG 


TTCTACACTA ACTTTCTGAA TACTATTTGG CCCAGCGGGA 


300 


AACCACCACG 


GGAACAAAGC 


AAGATGTGAT 


TGAAAGACTT ATGATAAACC GGGTCGGAAG 


360 


ACCCTTGGAA 


GGCTTTTATG 


GAGCAAGTGG 


AAGCTTAGAT CTAGCTTCTG GGAACCTTCC 


420 


GAAAATACCT 


CGTTCACCTT 


CGAATCTAGA 


TCTTAA 


456 



<2) INFORMATION FOR SEQ ID NO: 9s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 



AATTAAGCTT 


GGACGGGGCT 


CCATTCAACC 


AATACTACTT 


CGATAAGACC 


ATCAACAAAG 


60 


TTCGAACCTG 


CCCCGAGGTA 


AGTTGGTTAT 


GATGAAGCTA 


TTCTGGTAGT 


TGTTTCGAGA 


120 


CACACTCACG 


TATAATTCCT 


TCAACTTAGC 


CAGCTTCAGC 


ACTCCATTCG 


AATTGTCTCT 


180 


GTGTGAGTGC 


ATATTAAGGA AGTTGAATCG 


GTCGAAGTCG 


TGAGGTAAGC 


TTAACACAGG 


240 


GAACAACTTG 


CAGATAGGCG 


TCACAGGATT 


GAGTGCTGGT 


GACAAGGTCT 


ACATCGGTCC 


300 


CTTGTTGAAC 


GTCTATCCGC 


AGTGTCCTAA 


CTCACGACCA CTGTTCCAGA TGTAGCACAA 


360 


GATTGAGTTC 


ATTCCAGTGA ACCTTAGGTC 


CCCAGGAACC 


GAGCTTGAGT 


TCATCGTGTT 


420 


CTAACTCAAG 


TAAGGTCACT 


TGGAATCCAG 


GGGTCCTTGG 


CTCGAACTCA AGTAGCACAT 


480 


CTAGATCTTG 


TAGATCTAGA 


TTAA 
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We claim: 

1. A descendant plant cell comprising a pesticidal protein 
toxio encoded by a synthetic Bacillus thuringiensis (B.i) 55 
gene, said cell produced by the process of: 

selecting a B.t. pesticidal protein toxin desired to be 
expressed in a plant cell; 60 

obtaining a table indicating codon usage bias for a gene or 
genes more highly expressed in a plant cell than a 
native B.t. gene; 

using said table to design a modified coding sequence 65 
which encodes said protein toxin, whereby said modi- 
fied coding sequence has a frequency of codon usage 



that more closely resembles the frequency of codon 
usage of the plant cell in which it is to be expressed than 
did the native B.t. coding sequence encoding said 
protein toxin, said modified coding sequence having at 
least about 10% of the nucleotides changed as com- 
pared to the native B.t. coding sequence; 

obtaining a synthetic B.t. gene comprising a coding region 
comprising said modified coding sequence wherein 
said coding region is under the control of a plant - 
expressible promoter; 

introducing said synthetic B.t. gene into a plant cell; 

culturing said cell to obtain descendant plant cells or 
plants comprising descendant plant cells, said descen- 
dant plant cells comprising said synthetic B.t. gene; and 
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establishing that said synthetic B.t. gene is expressed in 
said descendant plant cells. 

2. The descendant plant cell of claim 1, wherein said 
modified coding sequence has an A+T base content of less 
than about 60%. 

3. The descendant plant cell of claim 1, wherein the 
process of producing said plant cell contains the additional 



48 



step of determining the frequency of codon usage of said 
modified coding sequence. 

4. The descendant plant cell of claim 3, wherein said 
modified coding sequence comprises an A+T base content of 
less than about 60%. 
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